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FOREWORD TO VOLUME 1 


THE study of the physical properties of metals has developed through 
a number of stages. The first was that in which the mechanical 
properties were correlated empirically with the heat treatment to 
which the metal had been subjected and, sometimes, to the chemical 
composition. At this stage the successful treatment of metals was an 
art, in the sense that experience rather than understanding led to the 
most satisfactory results. The next stage, in which the internal struc- 
ture of the metal was examined, was based originally on the use of the 
microscope and it was found that many experimental facts could be 
explained in terms of effects that were of the right size to be seen under 
magnifications of less than about two thousand. The development of 
the x-ray diffraction techniques allowed phenomena of a smaller order 
of magnitude to be examined, and much of the existing information 
was found to be comprehensible in terms of the geometry of the crystal 
structure of the various phases that were visible under the microscope. 

More recent development can perhaps best be discussed by a division 
of the field into what may be termed ‘statics’ and ‘dynamics’. Under 
the former heading is the study of the conditions which govern the 
structure of a metal or alloy when it is in thermodynamic and mechanical 
equilibrium. The theories of the phases that are present in equilibrium 
and of the elastic constants have made remarkable progress in terms of 
rapidly developing theory of the part played by electrons in the metal. 

Under the heading ‘dynamic’ effects we may include both the 
conditions governing the approach to equilibrium in respect of the 
phases that are present, in which diffusion plays an important part, 
and the response of a metal to forces which are sufficient to cause 
non-recoverable or plastic mechanical deformation. 

These and associated subjects have advanced so rapidly that it has 
become difficult for research workers in one part of the field to remain 
up to date in other branches. It is the purpose of this volume, which 
is the first of an annual series, to present authoritative reviews of the 
present state of knowledge in specialized aspects of the field that includes 
both physical metallurgy and metal physics. It is not intended that any 
one volume should form a complete textbook on these subjects. It is 
hoped rather that a few subjects of current interest should be discussed 
rather fully so as to cover, in the course of several years, all the more 
important aspects in which progress is being made. In order to make 
the series reasonably self contained it is proposed that the necessary 
‘historical’ background should be included the first time a particular 
subject is discussed. Subsequent articles on such subjects will generally 
only cover the more recent progress. 

B. CHALMERS 
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INTERNAL FRICTION IN METALS 


A. 8. Nowick 


THE capacity of a vibrating solid to convert its mechanical energy of 
vibration into heat, even when so well isolated that energy losses to its 
surroundings are negligible, is called internal friction. The most common 
manifestation of internal friction is the damping, or loss of vibration 
amplitude, of a freely vibrating body. Interest in internal friction in 
metals undoubtedly extends back to prehistoric times, to the earliest 
cymbals and bells. Modern investigation has followed two different 
channels. On the one hand there has been widespread interest in 
damping for engineering applications; depending on the particular 
application, one may wish to obtain either a very high or a very low 
rate of damping. In work of this kind, the effect of heat treatments and 
alloying elements on the damping characteristics of engineering materials 
is investigated and commonly reported in terms of a quantity called 
“damping capacity.” Vibration amplitudes employed are commen- 
surate with those that occur in practice. An adequate review of this 
phase of the subject would in itself require a sizeable article. A second 
approach to internal friction, with which the present review is con- 
cerned, is the use of damping as a tool to study internal structure and 
atomic movements in solids. This is a relatively new field of investiga- 
tion and has been developing rapidly. Interest is focused on the origin 
and mechanism of internal friction rather than its desirability, and the 
vibration amplitudes employed are very small. 

A perfectly elastic material will not produce damping since, under 
oscillatory conditions, stress and strain are always in phase, and 
consequently no mechanical hysteresis or loss of vibrational energy can 
take place. Damping is, therefore, a result of the non-elastic behaviour 
of materials. The fact that it can be observed at stress levels far below 
those at which plastic flow occurs, shows that there is really no such 
thing as an “elastic range.’’ In recent years there has been a great 
increase in understanding of those physical processes that result in 
non-elastic behaviour, and, therefore, in damping, of metals at low 
stress levels. Review articles on this subject, written in the period 
1939-1943)" emphasize three main sources of damping: thermo- 
elasticity, ferromagnetic effects, and the internal stresses associated 
with cold work, or dislocations. Of these, the last was least understood, 
in the sense that the mechanism by which these internal distortions act 
to produce damping was not established. During the war and early 
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post-war period intensive investigations of internal friction, particularly 
by ZENER and co-workers, brought to light other sources of internal 
friction, including stress relaxation across grain boundaries and stress 
induced ordering. These last two mechanisms together with thermo 
elastic internal friction fitted into a scheme of non-elastic behaviour 
which ZENER‘ called anelasticity, the development of which culminated 
n his book® in 1948. In the treatment of anelasticity it is shown that 
‘rnal friction but several related static effects (recoverable 
stress relaxation. and elastic after-effect at low stress levels 
resuit from this type ol non-elasti behaviour. Oniv internal friction 
that does not depend on the amplitude of vibration is considered in this 
treatment. The impact of this important work has been so great as to 
lead to the widespread misconception that all internal friction at low 
umplitudes can be explained in terms of anelasticity. The internal 
friction originating in cold work is an appropriate example; theories 
attempting to explain observed effects in terms of the scheme of anelas 
tic phenomena,” have not met with notable success In fact, after 
Reap’: showed that cold work internal friction in single crystals is 
strongly amplitude dependent, the writer demonstrated": ™ that this 
behaviour originates in a mechanism of static hysteresis, which is quite 
different from anelasticity 
In the present article, an attempt is first made to provide a pheno 
menological description of the general features of non-elastic deforma 
tion which is sufficiently general to include, as special cases, anelasticity 
3s well as static hysteresis and amplitude dependent internal friction 
he first three main sections deal with this phenomenological approach. 
It is shown that internal friction cannot be divorced from an equally 
important effect: the change in the effective elastic modulus resulting 
from the non-elastic behaviour, which is called “the A.M-effect 
Following the phenomenological treatment, the various experimental 
methods used to measure internal friction are reviewed. The remainder 
of the article is devoted to the physical origins of internal friction. All 
sources of damping involve an internal re-arrangement that takes place 
under stress. There are three types of re-arrangements that have been 
studied thermal the thermo-elastic effect magnetic, and atomic 
re-arrangements. All of these are considered in the present article, 
although emphasis is placed upon the latter category, since most recent 


developments are in this field and it is the least adequately reviewed. 


Included in the category of atomic re-arrangements are diffusion effects, 


relaxation across interfaces, and the motion of dislocations. 


PHENOMENOLOGICAL THEORY HoMOGENEOUS STRAIN 
In most vibration experiments the specimen under investigation 1s 
subjected to a non-homogeneous strain. A discussion of the elastic and 
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non-elastic behaviour of the specimen is, therefore, complicated by the 
dependence of stress and strain on position in the specimen. In this 
section, the principles of non-elastic behaviour will be introduced under 
the assumption that we are dealing either with a small element of a 
specimen within which the strain is uniform, or with a complete speci- 
men throughout which the strain is the same at any instant. The latter 
condition can be realized, for a specimen of appropriate shape, by the 
introduction of an auxiliary inertia member relative to which the 
inertia of the specimen itself is negligible. For example, it is possible to 
set up a homogeneous shear strain by torsional vibration of a hollow 
tube to which is attached a heavy inertia member. 

For simplicity of notation, it is assumed that the stress system within 
the specimen can be defined by a single component o. Furthermore, 
only the principal strain ¢«, corresponding to the stress o, need be 
considered, since other strain components do not enter into expressions 
for the work done by the stress. The symbol / is used to represent the 
appropriate elastic modulus that relates stress and strain. 

As already mentioned, in order for internal friction to occur, the 
oscillatory stress and strain must not be in phase with each other and, 
in fact, in order that energy be dissipated, the strain must lag behind 
the stress. Let us represent this angle of lag by ¢. Then ¢ is a con- 
venient measure of internal friction, since the energy dissipated in a 
cycle of vibration goes to zero when ¢ approaches zero. It is convenient 
to use complex notation to represent the various phase relationships. 
Thus, the variation of stress with time is expressed as 


(1) 
where w is the angular frequency of vibration (w = 2z7f, where / is the 
number of vibrations per second) and o, is the stress amplitude. 
Correspondingly, the strain is 

€ = (€, — te,)e™ a (2) 
where the quantities «, and ¢, are the amplitudes of the components 
of strain in phase with and 90° behind the stress, respectively. Clearly 

tan @ = &,/€, — : (3) 
Throughout this article it will be assumed that the internal friction is 
small, i.e. e, < e, and tan ¢é = ¢. This approximation greatly simplifies 
the relations between various measures of interna! friction, and there 


have been very few experiments to which it does not apply. An 
appropriate definition for the dynamic modulus, M, is 


M = o,/¢, Por a (4) 


It is also convenient to define the complex modulus, M, as the ratio of 
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stress to strain, using the complex notation of equations (1) and (2). 


Thus, 


~ U o 
g# =-—1_ = L___ + M(1+i¢).... (5) 
&— te, €,(1—4¢ tan ¢) 


where the approximation of small ¢ has been introduced. The real part 
of the complex modulus is M and the ratio of the imaginary part to the 
real part is the phase angle ¢, which is our present measure of internal 
friction. The complex modulus, therefore, contains the essential para- 
meters pertaining to the vibration. 

The occurrence of non-elastic behaviour means more than the fact 
that 6 + 0. In order to discuss the concept of non-elastic behaviour we 
must consider the law that describes the interrelation of stress and 
strain, not only under conditions of oscillation, but also under static 
conditions, e.g. the application of a constant stress. A perfect lyelastic 
substance is one for which the strain is uniquely determined by the 
stress at any instant, i.e. a monotonic functional relationship exists 
between the two quantities. This function only deviates from Hooke’s 
law of proportionality when the strain is large. Since we are concerned 
entirely with small deformations, a simple law of proportionality may 
always be used to represent the stress-strain equation for a perfectly 
elastic substance. A material that shows non-elastic behaviour is one for 
which the stress-strain equation is not a simple proportionality but 
assumes a generalized form, in which, for example, there may be terms 
involving time derivatives of stress and strain or terms that represent 
multiple-valued functional relationships. For self consistency with the 
requirement of small damping, we may limit ourselves to stress-strain 
equations where all these non-elastic terms are very much smaller than 
the elastic terms. If the stress-strain equation is linear in stress, strain, 
and time derivatives of these quantities, it can be solved by the substi- 
tution of equations (1) and (2) to obtain the complex modulus directly. 
If, as a result of the non-elastic behaviour, terms which are nonlinear 
appear in the stress-strain equation, an exact solution can no longer be 
obtained by the substitution of a simple harmonic stress and strain, but 
requires complete Fourier series for these quantities, i.e. stress and 
strain may be periodic but not sinusoidal. An approximate solution 
can be obtained™ if we ignore everything but the fundamental Fourier 
components of stress and strain and determine the complex modulus 
from these quantities. The parameters V and ¢ obtained from the real 
and imaginary parts of this complex modulus will depend on the ampli- 
tude of vibration, as a consequence of the nonlinear terms. We then 
speak of ‘“‘amplitude dependent internal friction” and, in general, also 
of an “amplitude dependent dynamic modulus.”’ This method, which 
converts a nonlinear problem into an approximately equivalent linear 
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problem with parameters dependent on the amplitude of vibration, is 
known as the method of equivalent linearization.“ The approximation 
is a good one when the nonlinear terms are small, a condition which is 
certainly valid in the present case, since all nonlinear terms are among 
the small non-elastic terms. 

We will now consider two well-known theories of non-elastic behavi- 
our which lead to very simple stress-strain equations. 


The Voigt Solid 

In order to account for non-elastic behaviour of materials, Vorer™ 
made the assumption (previously introduced by Stokes” and others) 
that, in addition to the elastic stress Me there is a frictional resistance 
proportional to the rate of strain ¢. We, therefore, obtain the stress- 


strain equation 
o = Me + né wet eo ae 


This is a linear equation and, therefore, leads to an amplitude indepen- 
dent internal friction. A material that can be described in terms of 
equation (6) is called a Voigt solid. The complex modulus is 


M = M + iwm, so that the internal friction is given by 
d = (n/M)w a 


The internal friction of a Voigt solid is, therefore, proportional to the 
frequency of vibration and “its dynamic modulus is a constant 
(independent of frequency). 

The Voigt assumptions are most often used in textbooks on theoretical 
physics,” to discuss the damped oscillations of a mass point attached 
to a weightless spring. The purpose of such treatments is usually to 
show how internal friction manifests itself in free and forced vibrations. 
No attention is usually paid to the question of the frequency dependence 
of internal friction. 


The Maxwell Solid 
An alternative description of non-elastic behaviour is due to MAaxwELL.?’ 
Rather than use the nebulous concept of an internal frictional force, 
this approach first assumes that the strain is made up of two parts: 
exe +e” : ea a, ae 
where ¢’ is the perfectly elastic strain which obeys Hooke’s law 
e’ = o/M' Me ee 


and e” is the non-elastic strain which results from internal re-arrangement 
and is not proportional to the stress. Equation (9) is to be regarded as 
the definition of M’. Maxwe tt further assumes that the non-elastic 


- 


o 





PROGRESS IN METAL PHYSICS 


term «” obeys the Newtonian viscosity equation, i.e. the flow é” is 
proportional to the stress, or 
fo ws + eee 


The stress-strain equation is obtained by combining equations (8), (9) 
and (10): 


(11) 


From this equation and the assumption of small internal friction, we 
obtain 
M=HM'; = §M'/w se ee Ce 


Again the dynamic modulus is a constant, but now ¢ is inversely 
proportional to the frequency. The amplitude independence of ¢ is 
related to the linearity of equation (11). 

It has been knowh for many years that the experimentally observed 
frequency dependence of internal friction, for most materials, is in 
agreement with neither the Vorct nor the MaxweE Lt descriptions of 
non-elastic behaviour. Another shortcoming of these descriptions is the 
prediction, in both cases, that the dynamic modulus of the system is 
unaffected by the non-elastic behaviour, except for terms of second order 
in the internal friction which are omitted here. Experimental observa- 
tions show that the occurrence of internal friction is often accompanied 
by a fractional decrease in the dynamic modulus, of the same order of 
magnitude as the internal friction. 

Various methods have been suggested for the generalization of the 
Voigt and Maxwell solids to obtain a more widely applicable theory. 
These methods regard the properties of an actual solid as a super- 
position of those of a large collection (finite or infinite) of Voigt or of 
Maxwell solids each with different parameters. In this way it is possible 
to introduce into the description of the non-elastic behaviour of a 
material as many arbitrary parameters as are necessary to describe its 


observed behaviour. Such superposition methods have been particularly 
popular in the study of mechanical properties of high polymers."*: ” 
They are limited, however, to the treatment of materials whose internal 
friction is independent of the amplitude of vibration, and also, cannot 
be used for the study of static hysteresis (described in a later section). 


It will. therefore, be more suitable for us to introduce a different 
approach, which is sufficiently general to include all phenomena that 
lead to internal friction in metals. 
The Generalized Maxwell Approach 


The first assumption of the Maxwell description is that a material 
under stress may undergo internal re-arrangement leading to a non- 
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elastic strain e” in addition to the elastic strain <’. This assumption, 
expressed by equations (8) and (9), is very generally applicable to non- 
elastic phenomena. On the other hand, we will drop the highly restric- 
tive second assumption expressed by equation (10) and, for the present, 
leave unspecified the equation that describes the non-elastic strain e”. 
Without such an equation, we cannot obtain the stress-strain relation 
required to complete the description of the behaviour of a material. 
Possible equations for e” will be discussed in a later section; at present, 
phase relationships will be discussed in general terms. 








Fig. 1. Phase relationships between the stress and the elastic 
and non-elastic strain 


If the stress (or its fundamental Fourier component) is represented 
by equation 1, the elastic strain is, 


(13) 


The non-elastic strain e” will, in general, be resolvable into a component 
in phase with the stress and one lagging by 90°, or 


” 


— ie,” )e Tree. 


(é; 


where the quantities ¢,’, ¢," and ¢,” are appropriate amplitudes and are 
real and positive quantities. Fig. 1 shows the phase relationships 
between stress and components of strain. This diagram is not drawn to 
scale, since the assumption of small internal friction and non-elastic 
effects means that ¢” e’. With this approximation ¢, ~ «,’ (to 


first order), and equation (3) becomes 
(15) 


The internal friction is, to the first approximation, dependent only on 
the component of e” out of phase with stress. The result is not surprising 
since only the component of strain out of phase with the stress will 
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contribute to energy dissipation. The dynamic modulus J is given by 
(see equation (4)) 
0; 


M ~ M'(1—e,"/e,’) ee (16) 


where M’ is defined by equation (9). Thus, since the presence of non- 
elastic behaviour may increase the in-phase component of strain, the 
dynamic modulus is less than (or equal to) M’, and the difference. 
AM = M’ — M is given by 


AM/M = «,"/e, eee 


This lowering of the elastic modulus because of the existence of a non- 
elastic strain will be called the A.M effect. This discussion has, therefore, 
shown that internal friction and the AM effect are closely related con- 
sequences of the occurrence of non-elastic strain, the first resulting from 
the component out of phase with stress, and the second from the 
component in phase with the stress. It is also clear that if the equation, 
thus far unspecified, that describes e” is nonlinear, the ratios e,"/e,’ and 
e,"/e,, will be functions of strain amplitude. Specific forms for this 
equation will be considered in the section called “‘Viscoelasticity and 
Static Hysteresis.” 


Measures of Internal Friction 


The relationship between various measures of internal friction, as well 
as of the AW effect, will now be considered. As long as the discussion 
of non-elastic behaviour is restricted to the use of quantities such as the 
complex modulus and phase angle ¢, it is possible to treat in the same 
way, either a small element of material or a complete specimen under 
homogeneous strain. If quantities such as the logarithmic decrement 
and the width of a resonance peak are to be considered, it is necessary to 
deal with an entire specimen in a particular mode of vibration. For 
simplicity, we will discuss here only specimens vibrating under homo- 
geneous strain, and in the next section will show how the present dis- 
cussion is generalized when the strain is not homogeneous. It is required, 
then, that a relatively large inertia member be attached to the specimen 
under consideration. If # represents the displacement of the inertia 
member, / its ‘“inertia,’’ F, and F, the “force” exerted upon this member 
by the specimen and the externally applied “force,” respectively, then 
the equation of motion of this member is 


l6=F,+F, ia eee 


s 


(In the example of the torsion of a hollow tube mentioned earlier, 4 is 
simply the angle of twist, J the moment of inertia, and the quantities 


s 
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F are torques. On the other hand, if the vibration is under a longitu- 
dina] load, 6 becomes the longitudinal displacement, J the attached mass, 
and F a force.) The quantity 6 is essentially the strain, ¢, throughout 
the specimen, except for a geometric constant of proportionality, 
while F, is proportional to the stress o, but opposite in sign. Inasmuch 
aso = Me (by the definition of 1, equation (5)), with M = M(1 + i¢), 
we may also write 


k = k(1 + id) 2: Sk oe 


where i is a complex “force constant.”” Equation (18) may then be solved 
in the case of forced vibrations with an impressed angular frequency , 
i.e. when the applied “force’’ is taken as sinusoidal of the form 
F, = Fye'™. Under steady state conditions, the equation will have a 
solution of the form 6 = 6,e"“, where 6, is, in general, complex. (If the 
stress-strain equation is nonlinear, @ will actually be representable by a 
complete Fourier series, but as mentioned earlier, we will ignore all but 
the fundamental Fourier components.) When this solution and 
equation (19) are substituted into equation (18), an equation for 6, 
is obtained : 
F/l 
4j=-; —-, sa eee 
(w,* — w*) + 1w,*d 

where 


(21) 


The real quantity |6,|* is given by 
(F/I)? 


(w,* — w*)* + w,*p* 


22) 


ig _j2 
6, 


This is the well-known equation for a resonance curve (a plot of |@,/* 
versus w) with a resonant angular frequency (at which the amplitude of 
vibration is a maximum) equal to w,. If the frequencies at which 
6,\* falls to half the maximum value are denoted by w, and w, it 
follows that 


(23) 


The internal friction ¢ may, therefore, be obtained in forced vibration as 
the ratio of the width of the resonance curve (square of displacement 
amplitude versus frequency) at half maximum divided by the resonant 
frequency. Inasmuch as the reciprocal of this ratio is called the “Q”’ 
of an electrical circuit, the symbol Q~ has been adopted by some 
authors’: ** as a measure of internal friction; from equation (23), Y~ is 
equal to the phase angle ¢ between stress and strain. The study of 
internal friction by means of a resonance curve is complicated when the 
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internal friction is amplitude dependent. When ¢ is itself a function of 
»|, and in fact (as commonly observed experimentally) an increasing 
function of 4,|, the height of the resonance peak will be determined 
by the value of ¢ at maximum amplitude, but the peak is broader than 
for the case of constant internal friction. Although the resonance peak 
is broadened by the amplitude dependence of ¢, it may still be sym- 
metrical about w = w, (to terms of first order in the internal friction) so 
long as there is no 4 effect. In general, however, amplitude depen- 
dent internal friction is accompanied by an amplitude dependent 
dynamic modulus WM. This means that & is a function of amplitude and, 
therefore, that , as well as ¢, in equation (22), is dependent on /6,|. 
In the usual case, where the AWM effect increases (and, therefore, w, 
decreases) with increasing amplitude, it is readily shown from equation 
(22) as well as experimentally” that the resonance curve falls off faster 
on the low frequency side than on the high frequency side. If the ampli- 
tude dependence of «, is sufficiently large, this asymmetry may become 
so great that the resonance curve bends over on itself and becomes 
double valued on the low frequency side. Thus, discontinuities may be 
observed in tracing this curve experimentally.” 

When amplitude dependent effects are studied by forced vibration 
methods, it is usually preferable to work at exactly the resonant 


frequency where, from equation (22), the amplitude at resonance 


6 


0,™| is 


6,” F ,/(1w,*4) -ooe Ge 


It is necessary to determine the maximum amplitude |6,"| and the 
force F, in order to obtain ¢; both ¢ and w, are then known at the 
specific amplitude 6,"|. By variation of F, it is then possible to 
obtain these values at any other displacement amplitude. Whereas, in 
principle this method is a very simple one for determination of the 
internal friction and AY effect as a function of amplitude, the quantities 
F,, and |6,"| are often difficult to obtain experimentally. In the piezo- 
electric method (to be discussed in a subsequent section) these quantities 
are readily calculated, so that this method has been particularly suitable 
to the study of amplitude dependent internal friction. In other 
methods of forced vibration, it is more convenient to keep /', constant 
and determine Q-' (equation (23)), which requires only a knowledge of 
relative amplitudes and does not require that 7, be known. The latter 
methods are, therefore, only useful when internal friction is independent 
of amplitude. 

In addition to the possibility for studying internal friction by the 
determination of the resonance curve in forced vibrations, the oldest 
and most popular methods are related to determination of the damping 
of free vibrations, where vibrations, once excited, are allowed to continue, 
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with no external forces applied to the system; the rate of decrease of 
amplitude is then determined. In order to solve the equation of motion 
18, for F, = 0 (no external forces), we substitute a solution of the form 


6 = 6 of a . @ = at) —+- 1d 2a) 


which represents exponentially damped oscillations, if 6 is a constant. 
(6, is now real.) When this solution is substituted into equation (18), 


and cognizance taken of equation (19), we obtain: /@* = k; if real and 
imaginary parts are now separately equated, and second order terms in 
internal friction neglected, it is found that 


w=k/l: 6=a7d » a: a ee 


The quantity 6 represents the natural logarithm of the ratio of ampli- 
tudes in two successive vibrations and is called the logarithmic decrement 
(or simply, the ‘““decrement’’). When this quantity is small (as always 
assumed in this discussion) it is also equal to the fractional decrease in 
vibrational amplitude per cycle. In order to obtain 6 from experiment, 
it is simply necessary to plot the logarithm of the amplitude of vibration 
against the number of cycles of vibration. If internal friction is 
independent of amplitude a straight line is obtained whose slope gives 
the decrement directly. However, if internal friction is amplitude 


dependent, this curve will not be a straight line, but its slope at any 
amplitude will give the decrement at that particular amplitude. Thus 
an entire curve of decrement versus amplitude of vibration may be 
obtained from a single decay curve if the precision of the data is 


sufficiently good. 

Equations (21) and (25) also show that the natural frequency in free 
vibration and the resonant frequency w, in forced vibration are the 
same. This resonant frequency is affected by the existence of a AM 
effect; under conditions where non-elastic behaviour does not occur, 
its value would be w’ = (k'/J)'*, where k’ is the force constant corres- 
ponding to the modulus M’. Because of the AY effect, there will be a 
decrease in resonant frequency and, if we define Aw = w’ — w,, we 
find that 

Aw/wo, = 44M/M ce oe Ce 


Thus, the existence of a AM effect is synonymous with a decrease in the 
resonant frequency or in the frequency of free vibration. Also, as 
already noted, amplitude dependent internal friction is usually accom- 
panied by an amplitude dependent dynamic modulus. This effect is 
detected by a change of resonant frequency, or of the frequency of free 
vibration, with a change in the amplitude of vibration. 

Another measure of internal friction is the fractional decrease in 
vibrational energy per cycle. Inasmuch as 6 is the fractional decrease 
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in amplitude per cycle, and the vibrational energy is proportional to the 
square of the amplitude, it follows that 


6=— AW/2W 


where W is the vibrational energy and AW is the loss of vibrational 
energy in one cycle. A closely related quantity which is widely used in 
engineering applications is the damping capacity which is the per cent 
decrease in vibrational energy per cycle, so that: damping capacity 
2006. 
The relation between the various measures of internal friction are 
summarized as follows: 


@ = 6/r7 = AW/2nW = g- 5. a Gra 


The last equality applies only when internal friction is amplitude 
independent. These simple relationships between the various measures 
of internal friction are derived under the assumption of small damping 
and break down when this assumption is no longer valid. For large 
damping, the relations between different measures of internal friction 
depend on the mechanism of the damping and must be derived separately 
for each case.™ 

The measured values of ¢ and AM/M will not only depend upon the 
frequency and possibly the amplitude of vibration, but will also depend 
on the type of stress employed. The relation between values obtained 
in shear and in longitudinal vibration, for example, will depend on the 
mechanism of the non-elastic behaviour. 


PHENOMENOLOGICAL THEORY: NON-HOMOGENEOUS STRAIN* 


In practice, damping is rarely, if ever, studied under conditions where 
the mass is all concentrated in an auxiliary inertia member, and the 
stress and strain are homogeneous throughout the “elastic’’ member. 
Very often, the material whose elastic and non-elastic properties are to 
be studied, will be in the form of a bar which is vibrated in a longitudinal, 
transverse, or torsional mode of oscillation. Under these circumstances 
the stress amplitude is not constant throughout the specimen. It will 
be shown in this section, however, that the interpretation of vibration 
experiments on these more realistic specimens is completely analogous 
to the case of homogeneous strain, except that special attention is 
required when the internal friction is amplitude dependent. 

It is well known®™ that a perfectly elastic system will vibrate in 
certain normal modes of vibration characteristic of the system. Once it 
is vibrating in one normal mode it will continue to do so indefinitely. 


* The reader who is not interested in the details of this section may omit the section 
without loss of continuity. 
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If small non-elastic effects are introduced, it is to be expected that the 
system will still vibrate in the same normal mode, except that now it 
will diminish in amplitude due to energy losses. Such behaviour will 
actually occur when the stress-strain equation is linear; in the case 
where nonlinear terms occur, the solution of the equation of motion is 
no longer a single normal mode of vibration. It is useful, however, to 
consider only the components of the vibration in a single normal mode, 
just as we considered only fundamental Fourier components in the case 
of homogeneous strain when a nonlinear stress-strain equation occurred. 
This extension of the method of equivalent linearization to vibration 
under non-homogeneous strain has been justified experimentally" over 
a range of amplitudes in which the internal friction reaches twice its 
value at low amplitudes. 

In most of the experimental methods a simple stress system is in- 
volved. We may, therefore, continue to describe the vibration in terms 
of a single component of stress o and the corresponding principal strain 
e. When the system vibrates in one particular normal mode, the strain ¢ 
may be written in the form 


e = C(t)G(z, y, z) iota ae 


where the space dependent function G is characteristic of the mode of 
vibration (an eigenfunction of the vibration equation). By means of this 
separation of variables, the problem of vibration in a normal mode is 
reduced to a one-dimensional problem similar to that of homogeneous 
strain, since only the co-ordinate C, which specifies the extent of excita- 
tion, is needed in order that the strain at every point in the specimen 
be known. A system vibrating very nearly in a normal mode, under 
small damping, may, therefore, be represented by an equivalent simple 
vibrator, whose equation of motion is of the form 18, where the quantity 
C is now the displacement. For a perfectly elastic system, C = C,e'”; 
on the other hand, if the specimen displays non-elastic behaviour, the 
quantity C will show damping in free vibration or a resonance curve in 
forced vibration. It is necessary to relate the damping ¢, of the equi- 
valent vibrator, which represents the behaviour of the specimen as a 
whole and the dynamic modulus M,, obtained from the resonant 
frequency of the specimen, to the corresponding properties ¢ and M of 
an element of specimen material. If the stress-strain equation is linear 
and the quantities 1 and ¢, therefore, independent of vibration 
amplitude, we may anticipate that d,= ¢ and M,= M. If, on the 
other hand, M and ¢ are functions of the strain amplitude ¢,, within a 
small element of the specimen, the quantities ¢, and M, will depend 
upon the amplitude of vibration C, of the specimen. 

To obtain the relations between the quantities that represent the 
behaviour of the specimen as a whole and those that represent the 
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properties of the specimen material, it is convenient to employ the 
notation of the generalized Maxwell approach introduced in the last 
section. The quantity C may be divided, in the usual way, into C’ and 
C”, so that the parameters characteristic of the specimen are: 


db C."1C,’; am Jd. i Sats: 


3 1 
where \M, = M’' — M, by analogy to the definition of AM. Now, 


consider an element of the specimen for which the total strain is ¢ and 
non-elastic strain «”. The internal friction 


p(€;) 9 &, . . ° ° (30) 


is, in general, a function of the strain amplitude ¢,. If the specimen is 
assumed to be vibrating very nearly in a single normal mode, as in 
equation (28), we have 

» G 


If these last two expressions are inserted into equation (30), we obtain 


C,"G = CG . 4(C,G) ee 


This equality cannot be strictly true unless ¢ is a constant (since 
dividing through G gives a space dependent function on the right and a 
constant on the left, if 6 + constant), i.e. as already stated, it is not 
strictly correct to assume that only a single mode of vibration is 
involved when ¢ is not constant. Inasmuch as any function of co- 
ordinates of the specimen can be expanded in terms of a set of eigen- 
functions, we may take components of the two sides of equation (31) 
corresponding to the normal mode G. To do this, we multiply both sides 
by G@ and integrate over the volume of the specimen to obtain 
jd(¢ 14) . G*dr ’ 
=" — coe s Oe 
jG*dr 

where dr represents an element of volume of the specimen. Equation 
(32) relates the experimentally measured function ¢,(C,) to the function 
¢(e,), which is a property only of the specimen material. The equation 
shows, as we have already anticipated, that only in the amplitude 
independent case is the internal friction measured on a specimen vibrating 
under non-homogeneous strain independent of the mode of vibration, 
i.e. dependent only on the properties of the specimen material. In 


exactly the same way, the result 


(33) 


AM,) _ (AM/M) . Gtdr 


(Sr, 


\G2dr 
is obtained. 
In summary, it has been shown in this section that the only difference 
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in the analysis of the non-elastic behaviour of a specimen vibrating in a 
normal mode and one for which the strain is homogeneous throughout, is 
that in the case of non-uniform strain the measured values of internal 
friction and the dynamic modulus represent a weighted average (as 
given in equations (32) and (33)) of the corresponding properties of the 
specimen material, when these properties are amplitude dependent. 


VISCOELASTICITY AND Static HYSTERESIS 


In the phenomenological treatment of internal friction given so far, 
MAXWELL’s first assumption (equations (8) and (9)) has been used, 
while his second assumption (equation (10)) has been eliminated on the 
grounds that it is too restrictive for a description of the non-elastic 
behaviour of most materials. It has been possible to introduce the AM 
effect and to show the relationship between the various measures of 
internal friction without substituting any other assumptions in place 
of equation (10). In order to obtain any specific information, such as 
the variation of internal friction with frequency or amplitude of 
vibration, it is obviously necessary to introduce expressions which give 
further information about the non-elastic strain. 


The ‘“‘Standard Linear Solid” 


A very important type of non-elastic behaviour is shown by materials 
that have been called® “‘standard linear solids.’”’ Such materials may 
also be said to show a “single time of relaxation.’”’ The basis for the 
behaviour to be described is the existence of an internal parameter p 
which takes on a different equilibrium value j at each value of the 
stress and whose variation is accompanied by a non-elastic strain «” 
The following three conditions are assumed to apply: (1) the non- 
elastic strain is proportional to p 


(34) 


2) The equilibrium value of p, denoted by #, is proportional to the 
stress. (3) The parameter p approaches its equilibrium value j, under 
constant stress, according to the relaxation equation, 


p = _ (p — Pp) T ° . . . (35) 


so that the difference p — j goes to zero exponentially, with relaxation 
time 7. From conditions (1) and (2), the equilibrium non-elastic strain 
é” (= Ap) is proportional to stress, and, therefore, to the elastic 
strain e’, or 

é" = Aye’ = Ayo/M’ a a, sae 


where the dimensionless proportionality constant A,, is called the 
relaxation strength and must be much less than unity if our assumption 
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of small damping is to be valid. When equations (34), (35) and (36) are 


combined they give, as an equation for e” 
é” (1/r)(e” Aye’) coe e Oe 


This result replaces equation (10) in the Maxwell approach and is 
essentially the same as that equation except for the presence of an 
extra term on the right-hand side. When a harmonic solution is 
substituted into equation (37), there results 


‘Je’ = Ay /(1 + twr) a eee. 


























4G 19 wr 


Fig. 2. Frequency dependence of internal friction and A.M effect for a 
standard linear solid. The functions are plotted for unit 
relaxation strength, A y, 


When equations (13) and (14) are substituted for e” and e’ and the real 
and imaginary parts of equation (38) separately equated, the internal 
friction and AW effect are obtained directly : 


a, oe Bee — i = oe 
& l + w*1 

AM «, 
— 


(40) 


or, for the dynamic modulus itself, 


M = M’|1 : — a 


Because of the assumptions of linear relationships (equations (34) and 
(36)) the internal friction and AM effect turn out to be independent of 
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vibration amplitude. The internal friction curve is a symmetrical peak 
when ¢ is plotted against log (wr), as shown in Fig. 2, with a maximum 
value equal to A,,/2 at wr = 1. The interpretation for the occurrence 
of such a peak is as follows: the internal parameter p will constantly 
strive to attain the appropriate equilibrium value # at every value of 
stress, i.e. the relaxation process o 
(equation (35)) is constantly going 
on. When the period of vibration 
is very short compared to 7, or 
wr > 1, practically no relaxation 
is possible during a cycle because 
the stress is changing rapidly com- 
pared to the time for achievement 
of equilibrium. Under these con- 
ditions «” is practically equal to 
zero, the behaviour of the material 
is essentially perfectly elastic, and, 
therefore, AM = 0, in agreement | 
with equation (40) and Fig. 2. 
The dynamic modulus M that is | 

| 








measured at sufficiently high fre- 

quencies is, therefore, equal to M’; 

it is also known® as the unrelaxed 

modulus, inasmuch as it is measured | 

under conditions where no relaxa- Fig. 3. Stress-strain curves in a 
tion occurs. Fig.3(a) shows the standard linear solid: (a) at very 
proportionality between stress and _—‘high frequencies, (b) at very low 
strain at high frequencies. The 
other extreme, wr < 1, corresponds 
to a period of vibration so much 
longer than 7 that there is sufficient time for achievement of very 
nearly equilibrium values of the parameter p at every value of stress. 
Under these conditions the strain at every value of stress is e’ + é” 
which is a unique function of stress. As illustrated in curve 6 of Fig. 3, 
the stress-strain relationship is again a straight line and, therefore, ¢ is 
essentially zero; the dynamic modulus /, which is the slope of this 
straight line is lower than J’, and, in fact, is equal to M’(1 — A,,) 
(from equation (41)). This value is known? as the relaxed modulus, since 
it is measured under conditions where relaxation is always essentially 


frequencies. In both cases stress 
and strain are in phase and there 
is no damping 


complete. Between these two extreme cases, stress and strain will not 

be unique functions of one another, since relaxation cannot keep up 

with alternations in stress. The stress-strain curve is then an ellipse 

whose area attains a maximum value when wr = 1. The slope of the 

major axis of the ellipse is the dynamic modulus M and always falls 
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between the two extreme values of Fig. 3. The variation of the quantity 
AM /M with frequency is shown in Fig. 2, which is a plot of equation 
40). It is interesting that only in the range where damping is high is 
there an appreciable dependence of the dynamic modulus on frequency. 
If measurements are made only at low frequencies, there is no way of 
distinguishing the behaviour of the specimen from that of a perfectly 
elastic substance whose modulus is .W‘(1 — A, 

Che standard linear solid subjected to a given type of vibration is 
completely described by three independent constants, MW’, A,, and r, 
in contrast to the one constant, the elastic modulus, of a perfectly 
elastic body. The appropriate modulus depends on the type of experi 
ment and in most cases will be either Young's modulus £ or the rigidity 
modulus G. The relaxation strength A,, is a measure of the magnitude 
of the non-elastic effects, i.e. the height of the internal friction peak and 
the value of A.M /M at very low frequencies. I[t is a quantity that should 
be small compared to unity in order that the assumptions made in 
deriving the relationship between different measures of internal friction 
be valid. Actual observed values of A, have ranged from 10~* to 
greater than 10-'. The latter value may seem to violate our requirement 
ol Aw 1, but the error in the application of the above theory to the case 
of Ay, 0-1 is not serious. The reason for designating the appropriate 
modulus as a subscript to A is that the relaxation strength is dependent 
upon the type of vibration experiment. For example, in many cases 
relaxation is produced only by shear stress. Thus, if a vibration experi- 
ment were conducted under hydrostatic stress only, we would have 
Ag = O(A bulk modulus). In a torsion experiment, 4, is involved 
and in tension or flexure 4,. However, since in tension only the shear 
components would produce relaxation, the relation between 4, and A, 
is readily obtained® from elasticity theory as 


A, 3G E)A, ; (42) 


, 


For the usual values of G/Z, A, is about 15 per cent greater than A,. 
The third independent constant of the standard linear solid is the 
relaxation time r. This parameter is essentially the mean time for the 


appropriate internal re-arrangements to occur which will lead to the 


uttainment of the equilibrium value of the internal parameter p. In 
the important case where a change in p occurs by atomic re-arrangement 
or diffusion, we have at our disposal a valuable method for obtaining the 
mean atomic jump time. This application of internal friction measure- 
ments will be discussed later. Whenever atomic movements of any kind 
are involved in the relaxation process it is to be expected that r will be 
strongly temperature dependent, according to the Arrhenius equation, 


Tee" RT sess a 
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where H is a heat of activation, RF is the gas constant, and 7' the absolute 
temperature. The existence of such a relationship can be used to great 
advantage when it is desired to obtain the internal friction peak 
experimentally. If this peak is to be obtained by varying the frequency 
it requires measurements over two or three powers of ten in w (see Fig. 2). 
Such measurements usually are experimentally difficult. Inasmuch as 
¢ is dependent only on the product w7, the same end can be achieved if 
¢ is observed as a function of 7 (by varying the temperature) at constant 
w as by a study of ¢ as a function of w at constant 7. A similar curve to 
that of Fig. 2 may, therefore, be obtained by plotting ¢ against 7 
when 7 obeys equation (43). This curve will still have a maximum when 
wr = 1, except that + is now the variable. Curves of ¢ against 7’ 
obtained at two different frequencies w, and w, can be superposed by a 
horizontal! shift, and from the magnitude of this shift the heat of activa 

tion H can be calculated as follows: for the same value of ¢, «,7, 
= W&,T,, Where the subscripts | and 2 refer to the two curves. Therefore 


“a 


» eH /RT, wget /BTs 


H 
In(w, @,) RP’ AQL/T) ; , (44) 


so that from the experimentally observed shift, A(1/7'), between the 
curves for frequencies w, and w,, the quantity H is determined. Equa- 
tion (44) is more generally valid than the case of a standard linear 
solid, since its derivation is dependent only on the fact that ¢ is a 
function of w and + through the product wr, and not on the form of 
that function. 


Superposition of Relaxation Times 


Although there are several physical mechanisms in metals and other 
solids that lead to non-elastic behaviour describable by equation (37), a 
material whose internal friction curve is given precisely by equation (39), 
over the entire frequency range, has never been found. On the one hand 
one may find more than one such peak, resulting from different mecha- 
nisms and located at different positions on the frequency scale. Thus, 
an acoustic or relaxation spectrum® is said to exist, somewhat in analogy 
to the optical spectrum. On the other hand, any single peak of the 
relaxation spectrum may deviate to a lesser or greater extent from the 
theoretical curve of equation (39). The reason for this deviation is 
related to fluctuations in the time of relaxation at different positions in 
the specimen. For example, if the relaxation process is controlled by 
atomic diffusion, the time of relaxation will be closely related to the 
mean atomic jump time. This latter quantity is capable of fluctuating 
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in accordance with fluctuations in the number of impurity atoms and in 
the state of internal stress in the immediate neighbourhood of the atoms 
under consideration. Usually, therefore, the term «” will be made up of 
a series of terms each obeying equation (37) but with slightly different 
values of r. The observed internal friction and AW effect may then be 
a superposition of terms of forms given by equations (39) and (40), with 
different values of r. The existence of a range of relaxation times, no 
matter how narrow, in place of a single relaxation time, will always result 
in a broadening of the internal friction peak. Equation (39), therefore, 
serves as a theoretical limit which is very closely approached in some 
cases (e.g. interstitial diffusion in b.c.c. lattices) while in other cases 
such as grain-boundary internal friction) considerably wider peaks are 
observed. These examples will be discussed more fully in subsequent 
sections. 

When the observed internal friction is the result of the superposition 
of mechanisms which have a range of relaxation times, but for which each 
value of r obeys equation (43), there are two possibilities that must be 
considered: (1) the various values of +r differ from each other only in 
the values of r,; (2) different values of H are involved. The first case 
(probably illustrated by grain-boundary internal friction) occurs when 
the various values of r are each different multiples of a unit jump time 
r,, which itself obeys an Arrhenius equation. Such a situation might 
arise if, in different parts of a material, relaxation is produced by the 
occurrence of a different number of atomic jumps, each jump requiring 
a mean time 7,. In this case, the internal friction curve, no matter what 
its form, is still dependent on frequency and temperature through the 
product we”/*"_ The arguments leading to equation (44), therefore, still 
apply and the value of H may be obtained from the horizontal shift of 
the curve of ¢ versus 7" with a change of frequency. In the second 
case, in which H is not constant, a change of frequency produces a change 
in the shape of the internal friction curve, since its various components 
are shifted by different amounts. If the change of shape is not too great, 
an average value of H may be obtained from the average horizontal 
shift. 

A nelasticity 

The material that shows a single time of relaxation and one which is 
describable in terms of a superposition of relaxation times are the most 
important examples of that field of non-elastic behaviour that ZENER*: ® 
has called ‘“‘anelasticity.”’ In order that an imperfectly elastic solid be 
said to show anelasticity two conditions are required: (1) the stress- 
strain equation should be linear; (2) if a static stress is applied to such a 
material and then removed, no permanent set shall remain, i.e. any 
residual strain that exists after the stress is removed shall decay back 
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to zero if sufficient time is allowed. As an example, in the case of the 
standard linear solid, if equations (8) and (9) are combined with (37) it is 
possible to eliminate ¢’ and e” to obtain a linear equation in ¢, é, o, and 
6 (the stress-strain equation). (This equation may be generalized™* by 
including higher derivatives of e and co.) Anelasticity excludes ampli- 
tude dependent effects, for as long as condition (1) applies, the internal 
friction and AM effect will be independent of amplitude. As for 
condition (2) in the case of the standard linear solid, it follows from the 
defining assumptions that there must be only one equilibrium value of 
strain for every value of stress. It is readily shown that the Maxwell 
solid does not obey the second condition for anelasticity, although it 
clearly satisfies the first condition. (If the stress is brought to zero after 
non-elastic deformation has occurred, equation (10) states that e” = 0, 
i.e. there is no recovery of the non-elastic strain in the Maxwell solid.) 


Coupled Relaxations 

It is possible that if several sources of relaxation with different values 
of + are operating simultaneously, the internal friction may not be a 
simple superposition of the values that would be found if each were 
acting separately. If the relaxation of one source affects the conditions 
at the second, coupling may be said to occur and the resultant relaxa- 
tion may be considerably greater than that obtained by superposition. 


That such coupling of relaxations can occur has been noted recently* in 
connexion with an irregular arrangement of interfaces in a metal. The 
phenomenon of coupled relaxations, which also falls into the category 
of anelasticity, will be further discussed in a subsequent section, in 
relation to the mechanisms that produce it. 


General Viscoelastic Behaviour 

Although most studies of the mechanism of internal friction in metals 
have thus far centred about anelastic effects, the two conditions for 
anelasticity have been too restrictive to be as widely applicable in the 
study of high polymers. Workers in the latter field have found*® that 
in many cases, it is not possible to give adequate description to the 
mechanical properties of polymeric substances in terms of linear 
equations. They have also found®’ that equations which call for viscous 
flow and which, therefore, permit the occurrence of a permanent set, 
are also necessary to interpret the experimental behaviour of these 
materials. Any material which can be described in terms of a differential 
equation in stress, strain, and the time derivatives of stress and strain, 
whether or not the equation is linear or implies the possibility of a 
permanent set, is called viscoelastic. In metal physics the occurrence 
of viscoelastic behaviour outside of anelasticity has not been widely 
reported, probably because extensive enough investigations have not 
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been made and also because most investigations have been conducted at 
low temperatures where permanent set is less likely to occur. In one 
investigation*®. *° of internal friction at relatively high temperatures, it 
has been established that the mechanism of damping is one that 
corresponds to a viscous-type flow under static stress. 

On the basis of present experimental evidence it seems as if any 
viscoelastic behaviour in metals (even the type that produces a per- 
manent set) does not result in strongly amplitude dependent internal 
friction at the low strain amplitudes at which most experiments are 
conducted. The maximum strain amplitude ordinarily employed is of 
the order 10~-° or lower. The reason for this independence of amplitude 
below strains of 10-° is probably that even where a nonlinear equation 
is involved, it can usually be approximated adequately by linear 
expressions, at these low amplitudes. Amplitude dependent internal 
friction plays a more important role in the case of static hysteresis, 
which will now be discussed. 


Static Hysteresis 

In this section, it will be shown that a material that manifests no 
viscoelastic behaviour may still behave non-elastically and produce 
internal friction and a AM effect, as a result of a mechanism entirely 
different from viscoelasticity. 

Internal friction can only occur when stress and strain are not 
unique functions of one another in vibration, that is, when the dynamic 
stress-strain curve is a hysteresis loop. In viscoelastic materials, 
hysteresis arises as a consequence of the appearance of time derivatives 
in the stress-strain equation coupled with the fact that the experiment 
is conducted under oscillatory stresses. The occurrence of the hysteresis 
loop is a consequence of the dynamic nature of the experiment and the 
area of the loop is very much dependent on frequency. In the case of an 
anelastic material, for example, if an experiment is conducted statically 
with stress applied and released sufficiently slowly to keep the material 
constantly in equilibrium, the area of the hysteresis loop will go to 
zero; stress and strain will be uniquely related. We have already seen 
such behaviour in curve 6 of Fig. 3. Hysteresis may also arise, in 
materials that are not viscoelastic, in the following way: suppose that 
as the stress is increased there is a definite value of strain for each stress, 
this strain being attained instantaneously and, therefore, independently 
of the rate of loading. Now, when a particular maximum strain is 
attained, the stress is released. Again it is assumed that at each value of 
the decreasing stress the strain instantaneously adjusts itself to an 
appropriate value, but the strain, for a given stress during unloading, is 
not the same as the corresponding value during loading. A permanent 
residual deformation remains after removal of the stress which can only 
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be brought back to zero strain by the application of a reversed load. 
This type of hysteresis loop, illustrated in Fig. 4, is the same whether it 
is traversed statically or dynamically ; it is entirely independent of the 
frequency of vibration. The stress-strain equation, therefore, does not 
involve time derivatives, as in the case of viscoelasticity, but is, simply, 
a multivalued function. This phenomenon is well known in the case of 
large-scale plastic deformation at low temperatures where the deforma- 
tion is practically instantaneous, so that the amount of deformation is 
almost independent of the r 
rate of loading. It was with 
regard to such hysteresis, 
obtainable in large-scale 
plastic flow, that BurcErRs*®” 
suggested the term static 
hysteresis in contrast to the 
dynamic hysteresis displayed 
by viscoelastic materials. 
The fact that static hysteresis Strain 
occurs for stresses greater 
than those required to start 
plastic flow may be impor- 
tant in fatigue studies, but 
certainly not in internal fric- | 
tion at very low amplitudes. 
It has been known for some | 
time, however, that ferro- | 
magnetic substances, because 
of the phenomenon of mag- 
netostriction, will follow a 
mechanical hysteresis loop 
that is independent of fre- 
quency and which gives rise to internal friction at low amplitudes. 
Recently it has been demonstrated": ?* that damping related to the 
motion of dislocations in a single crystal originates from a static hystere- 
sis loop at strain amplitudes as low as 10-’.* It, therefore, appears that 
static hysteresis may occur at exceedingly low amplitudes, and that it 
may result from atomic as well as magnetic re-arrangements. Obviously, 
such re-arrangements do not actually occur instantaneously, but are 
probably propagated with the velocity of sound. Such velocities are 
sufficiently “instantaneous” at frequencies normally used in vibration 
experiments. 

In contrast to viscoelastic internal friction, that originating in static 
hysteresis seems to show a marked amplitude dependence even at very 
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Fig. 4. Schematic illustration of a static 
hysteresis loop. The same loop is obtained 
regardless of the rate of traversal 


* The writer in his original description of this behaviour called it ‘‘simple hysteresis.” 
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low amplitudes. Such observations are not at all surprising because the 
static hysteresis loop will, in general, deviate strikingly from a linear 
relationship (as, for example, Fig. 4). 

In order to calculate ¢ and AM/M for a static hysteresis loop, it is 
convenient to consider only the loop of non-elastic strain «” versus 
stress. Let us represent the slope of these curves, de"/do, by* 


de" da = F(a) (loading) 
= f(a) (unloading) Ek eee 


If ¢ = o, cos wt, where o, is the amplitude, e” is periodic but in general 
not simple harmonic. It is desired, in the usual way, to take the funda- 
mental Fourier components of «” in phase with and 90° behind the 
stress. These components, e,” and e,” respectively, may be obtained by 
Fourier analysis (after partial integration to obtain integrals of de" /do 
rather than of e”) and inserted into the expressions (15) and (17) to give 


2M’ (7 
— | 







= (F(a, cos 6) — f(a, cos 6)]} cos 9 sin 6d6. . . . (46) 
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In the derivation of equations (46) and (47) it is assumed that F and / 
are even functions, i.e. the two halves of the hysteresis loop are identical. 
Such an assumption will generally be valid under steady-state condi- 
tions. In the case of the parallelogram loop, where F and f are constants, 
the quantities ¢ and AM/M are independent of the stress amplitude 
:» In the more general case, it may be convenient to use power series 
expansions for the functions F and /: 
F(a) = Ay + A,\o| + Ao? +. . -) 


(F(a, cos 6) + f(¢, cos 6)] sin* 6d6 : ols: 6 ee 









ao 







(438) 






f(a) = ag + a,\0| + a,o*7 +... 






The absolute value |¢| appears in the linear term because F and f 
are even functions. When these expressions are substituted into 
equations (46) and (47) and the appropriate integrals evaluated, the 
amplitude dependence of ¢ and AM/M may be obtained as power series 








in the strain amplitude 


M 
d — [(Ag— aq) + 3M(A, — a,)e, + $.M*%(A,— a,)e,7 +... | 
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M’ | (Ag + aq) + § — (A, + ae, + 4A, + a, )e? + . . | 









(49) 


* It is assumed, in this treatment, that the functions F and / are independent of 






the stress amplitude. This assumption is a restrictive one, and the question of its 
{ 






validity can only be determined when the mechanism of the non-elastic behaviour is 






Known. 
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where we have substituted Me, = o,. In general, if there is no reason 
to expect that the stress-strain curve is a parallelogram loop, the 
occurrence of static hysteresis is closely associated with amplitude 
dependent effects even at very low amplitude. This characteristic 
amplitude dependence together with the frequency independence of 
internal friction resulting from static hysteresis make it possible to 
distinguish this type of behaviour from viscoelasticity. 
Summary of Phenomenological Theory 
The various mechanisms that produce damping may be classified into 
the following categories of non-elastic behaviour : 
I. Viscoelasticity (dynamic hysteresis) 
A. Anelasticity 
Single time of relaxation (standard linear solid). 
Superposition of relaxations. 
3. Coupled relaxations. 
B. Processes involving viscous flow. 
II. Static hysteresis. 
More complex behaviour, which may be represented as a combination 


of viscoelasticity and static hysteresis, is also possible but has not been 
investigated as yet. 


METHODS OF MEASUREMENT 


The methods most commonly used in the measurement of internal 
friction will be classified and the general principles discussed briefly 
in this section. The specific variations of each general method have 
been too numerous to make a detailed discussion of them possible here. 


The Torsion Pendulum 

This general type of method is essentially distinct from the others to be 
discussed, in that it is a low frequency method. The use of an auxiliary 
inertia member of relatively large moment of inertia compared to that 
of the specimen to which it is attached, usually results in a low torsional 
frequency of vibration, in the range 0-1 to 15 vibrations per second. 
The damping is determined by observation of the rate of free decay, 
i.e. the logarithmic decrement (equation (25)), usually by means of an 
optical method, where a beam of light is reflected by a mirror mounted 
on the inertia member and viewed on a translucent scale at an appro- 
priate distance from the mirror. The initial excitation of vibrations is 
usually accomplished by making part of the inertia member out of a 
ferromagnetic material and suitably placing an electromagnet so as to 
attract this part when a switch is closed to activate the magnet. 
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If the frequency of vibration is less than about 2 sec it is possible 
to obtain the amplitude as a function of number of vibrations either 
visually or by automatic (usually photographic) recording; at higher 
frequencies the automatic method must be used. 

The torsion pendulum method is the oldest and simplest method for 
measuring internal friction. This method has a long history, dating 
back to CouLOMB at the end of the eighteenth century, and having 
recently come into wide use in such modifications as those of 
CHEVENARD™ and Bov.tancer™ (the Coulomb micropendulum), 
Ke, and Frive.** The torsion pendulum of K& has been imitated by 
several authors and used in most of the recent measurements at low 
frequencies. It requires a specimen in wire form, usually 6 to 12 in. long 
and about 0-03 in. in diameter. The Coulomb micropendulum, on the 
other hand, uses a specimen about 4 in. long and 0-06 in. in diameter. 
Automatic recording methods have been used by several experi- 
34, 35 


menters.**. 

The strain amplitude in a cylindrical specimen of circular cross 
section, vibrating in a torsion pendulum, is variable, from a maximum 
value at the surface to zero at the cylinder axis. The maximum 
amplitude of shear strain in the specimen is usually of the order 10~-°. 
The torsion pendulum is particularly useful for large values of damping 
up to values of ¢ of the order 0-1, while the lower limit is not much 


below 10-*, even when the chamber is evacuated to eliminate air 
damping effects.** The study of materials that show very low damping, 
of the order 10-°, has not been possible, thus far, with the torsion 
pendulum because of the background damping due to losses in the 
apparatus. 

It is possible to obtain the amplitude dependence of internal friction 
from a decay curve in free vibration, as already pointed out in the last 
section. The dependence of vibration frequency on amplitude is much 
more difficult to determine by this method. To obtain the dependence 
of internal friction on temperature, the specimen is placed inside a 
vertical tubular furnace, preferably noninductively wound. 


High Frequency Methods 

These methods are usually characterized by the absence of an auxiliary 
inertia member; the specimen, in the form of a rod, is usually vibrated 
in one of its normal modes. Both free decay and steady-state forced 
vibrations may be studied. The frequency range varies depending on 
the type of vibration produced. For example, in torsional or longitudinal 
modes of vibration of cylindrical specimens, the frequency, which 
depends primarily on the length of specimen, usually ranges from 10* 
to 10* cycles per second. Flexural vibrations of a rod, on the other 
hand, usually are observed in the frequency range 300 to 10,000 cycles 


> 
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per second, and the frequency depends on both the diameter and length 
of the specimen. 

There are a wide variety of methods that have been used for the 
excitation and detection of vibrations at high frequencies. In most of 
them the power source is an electrical oscillator. The methods generally 
differ in the manner by which electrical oscillations are converted into 
the mechanical vibrations of the specimen. Detection consists of a 
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Fig. 5. Schematic diagrams of the apparatus for various high 
frequency methods of measurement of internal friction 


means for reconverting mechanical into electrical vibrations with sub 
sequent amplification. Most investigators have employed parallel 
schemes for excitation and detection, but in some cases completely 
different methods have been used for these two purposes. The most 
common high frequency methods, which are illustrated schematically in 
Fig. 5 (taken from SrecEL*), are as follows. 

(a) The electrostatic method*’*-* treats one end of the specimen as 
one plate of a condenser, applying an alternating potential difference 
between the specimen and an adjacent fixed plate to excite mechanical 
vibration. Detection of the vibration may be achieved by forming a 
similar condenser at the other end which is used as a condenser micro- 


phone. Frequency modulation has also been used*®* as a method of 
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detection. If the specimen is not a conductor, the electrostatic method 
may still be used if a metal foil is cemented to the specimen.” The 
electrostatic method is suitable for longitudinal and flexural vibrations ; 
it may also be used for torsion®’ if electrodes are attached to the 


specimen in an appropriate manner. 

b) The eddy-current method™: “ is limited to longitudinal vibration 
of specimens that are good conductors. In order to drive the specimen, 
one end of it is surrounded by a coil and placed in the field of a per- 
manent magnet. An alternating current from an oscillator through the 
coil induces eddy currents in the specimen which, in turn, interact 
with the magnetic field to set the specimen into vibration. If a similar 
coil and magnetic field are located at the other end of the specimen, 
the motion of the specimen in this magnetic field induces eddy currents 
in the specimen. These currents, in turn, induce an electromotive force 
in the coil, which may then be amplified. The amplitude of vibration 
produced in the specimen is very small, since the electromechanical 
coupling is weak. The method is, therefore, only suitable for the study 
of very low internal friction, where not too much power is needed to 
drive the specimen. Adequate shielding is required to prevent direct 
coupling between the detector and input coils. The chief advantage of 
the method is the fact that nothing is attached to the specimen which 
might produce internal strains and, therefore, additional spurious 
contributions to the damping. 

(c) The magnetic method“ requires that pole pieces (discs of 
ferromagnetic material) be attached to the specimen unless it is itself 
ferromagnetic. A coil, placed close to one of the discs is connected to 
the oscillator and the resulting alternating magnetic field interacts with 
this pole piece to drive the specimen. A second coil near the other disc 
develops an induced e.m.f. which is amplified and detected. The 
positioning of the pole pieces and coils determines the mode of vibration ; 
the method may be used for longitudinal, flexural, and even for tor- 
sional™ vibration by appropriate arrangement of the pole pieces. An 
additional advantage of the method is that it may be used for materials 
that are not electrical conductors. It is particularly useful for the study 
of large damping, because of the relatively strong electromechanical 
coupling. On the other hand, the possibility that the cementing of pole 
pieces to the specimen produces internal strains which contribute to the 
damping, means that this method is not especially good for cases of 
very small damping. An interesting variation of this method is that of 
Forster,” in which the two discs are not directly cemented to the 
specimen but attached to it through the supporting wires. The speci- 
men is then driven by the oscillator through one wire and the vibrations 
detected through the other. The method is particularly useful for 
measurements at high temperatures, since the supporting wires may be 
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brought outside the furnace so that all the electrical equipment is at 
room temperature. 

(d) The piezoelectric method of Quimpy*: *’ utilizes a quartz crystal, 
to which the specimen is cemented. Application of an alternating 
potential difference across the quartz sets the two cylinders into 
vibration as a single resonant body, or a “composite oscillator.” 
Longitudinal and torsional vibrations are produced respectively, by 
means of an X-cut and a Y-cut quartz crystal. The positioning of the 
electrodes for these two cases is shown in Fig. 5d. The equivalent 
electrical circuit of this composite oscillator at the resonant frequency 
depends on the elastic and dissipative properties of both the quartz 
and the specimen, but the contributions due to the quartz crystal may 
be determined separately and taken into account. In particular, the 
equivalent resistance of the circuit, which may be measured in an a.c. 
bridge,* gives a composite internal friction from which that of the 
quartz alone may be subtracted. The contribution of the adhesive 
between quartz and specimen to the measured damping is small if the 
natural resonant frequency of the specimen is closely matched to that 
of the quartz crystal, so that a node of stress occurs at the junction. 
A range of values of maximum strain amplitude from about 10-* to 
10-* may be obtained. Reap* has shown how this amplitude is calcu- 
lated from measurable quantities. This method has been the most 
important one for the measurement of amplitude dependent internal 
friction, because the internal friction at any amplitude may be obtained 
directly from the equivalent resistance at resonance, once the propor- 
tionality constant relating these quantities is known. The specimen 
need not be an electrical conductor in this piezoelectric method, but 
the requirement of close matching in the individual resonant frequencies 
of the quartz and specimen is an important limitation. In addition, it 
had been found* that strains at the junction resulting from differences 
in thermal expansion of the quartz and specimen, made significant 
contributions to the damping, and thus defeated attempts to measure 
internal friction as a function of temperature by this method. This 
difficulty is resolved™ by the use of a liquid adhesive film; however, the 
handling of the composite oscillator becomes more difficult when a 
liquid adhesive is used. 

In all the high frequency methods, the external contribution to the 
damping is limited only to losses in the supports, if the system is 
evacuated. The contribution to the damping resulting from the 
presence of air in the system is usually of the order 10-*; that due to 
the supports can be made considerably lower if the supports are located 
at displacement nodes. Thus, values of ¢ as low as 2 x 10-* have been 
reported®* in evacuated systems. 

The high frequency methods are suitable for the study of either free 
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or forced vibrations. If the damping is sufficiently small and frequency 
not too high (i.e. not too large a decay in | sec), it is often possible 
to measure directly the time of free decay to a particular fraction of 
the initial amplitude. More generally, electrical circuits may be used®!—** 
to time the reduction of amplitude by any predetermined factor. In 
contrast to the damping in free decay, which becomes easier to measure 
for low damping, the resonance curve in forced vibrations becomes 
more difficult to measure as damping gets very small, because of the 
sharpness of resonance. Electrical oscillators can be constructed of 
such stability and with sufficiently fine frequency controls, however, 
that a resonance curve for which Q-' (equation (23)) is 10-> may be 
measured without difficulty. It is only necessary that one be able to 
read frequency variations of better than one part in 10* to obtain such a 
resonance curve; the frequency itself need only be known to about 
l per cent. 

A method that utilizes frequencies in the megacycle range is the 
pulse technique, by which a short ultrasonic pulse is produced at one 
end of the specimen and its time of transit through the specimen and 
attenuation are measured. Although this method has been used success- 
fully for the measurement of elastic constants, ** the measured 
attenuation may be, in a large part due to scattering by imperfections 
rather than to absorption, because of the short wave lengths employed. 


This method, therefore, has not as yet been used for quantitative 
measurements of internal friction. 


STRESS-INDUCED OrpDERING: A Source oF INTERNAL FRICTION 


A discussion of order concerns itself with the distribution of different 
atomic species in a solid solution on various lattice sites. If there is any 
deviation from a completely random distribution in the manner in 
which lattice sites are occupied, the alloy is said to show a degree of 
order. The concepts of long range and short range order that have been 
so widely discussed,*’ are special cases of this definition. The applica- 
tion of stress to a system may result in a change in the state of order, 
if one already exists, or it may produce a state of order in an initially 
disordered solid solution. For stress to have such an effect, it is simply 
required that a particular change in the state of order result in the 
production of strain. This requirement is simply a consequence of the 
thermodynamic reciprocity relations. For example, in the study of 
piezoelectricity, it is well known®* that if the application of stress 
produces electrical charges on particular surfaces of a crystal, then a 
voltage across these surfaces will produce strain (electrostriction). It 
will be of value to derive the reciprocity relation that applies to the 
change of order induced by stress. Let p represent a parameter that is a 
measure of the state of order. Deviation from randomness means the 
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existence of an energy term that favours ordering. Let u represent this 
ordering energy: the energy decrease per unit volume, per unit increase 
in p at constant strain (defined so that u > 0), or 

u = (— dU /dp), ae 


where U is the total energy of the system per unit volume. If we anti- 
cipate that u is not appreciably temperature dependent, we may esti- 
mate its value by considerations at absolute zero, where 

dU = ode — udp 


The differential of U — oe is, therefore, — edo — udp, and since it is a 
perfect differential, the reciprocity relation 


(Qu/dc), = (de/p), aig 


is valid. The meaning of this relation is that if a change of order at 
constant stress produces strain, the application of a stress will increase 
the ordering energy and will, therefore, shift the equilibrium in the 
direction of a greater degree of order. 

In this section three examples of such stress-induced ordering will be 
discussed. In the first two cases the state of the material for zero 
stress is one of randomness or disorder; in the third example, we will 
consider initially ordered materials. In all cases, it is expected that the 
behaviour of the material under stress will correspond very closely to 
that of the standard linear solid, since the three conditions that define 
such a solid are valid to a good approximation. First, it is reasonable to 
assume that the non-elastic strain that accompanies a small change in 
p is proportional to this change. Equation (34) is, therefore, valid if 
p is redefined as the difference between the state of order existing and 
that at zero stress, so as to obtain a direct proportionality between 
e” and p instead of a linear relationship. Furthermore, the change in 
the equilibrium state of order under stress is proportional to the change 
of ordering energy, u, which is, in turn, proportional to the stress (again 
assuming small changes). Finally, the rate of approach to equilibrium 
is determined by the difference between atomic jumps that lead to 
greater and to lesser degrees of order per unit time, which is in turn, 
proportional to the deviation of p from its equilibrium value. (For p 
equal to its equilibrium value j, the rate of occurrence of atomic jumps 
that act to increase p and those that act to decrease p must be equal.) 
Equation (35) is, therefore, valid. The mechanical properties of a 
material in which stress-induced ordering occurs will, therefore, be 
those of a standard linear solid. In particular, measurement of internal 
friction as a function of vibration frequency will show the characteristic 
peak whose equation is given by (39). The time of relaxation r, 
associated with stress-induced ordering must bear a simple relation to 
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the mean frequency of atomic jumps, since the mechanism of the 
change of the state of order must be the same as that for volume 
diffusion. It follows that 

= al’ » lo. ee 


where IT is the appropriate atom jump frequency and « a dimensionless 
constant. Inasmuch as a change in the state of order of a material can 
be accomplished by a small number of atomic jumps, the magnitude of « 
is of the order of unity. It therefore appears that by simply locating 
an internal friction peak, which determines 7 according to the relation 
wr = 1, it is possible to measure atomic jump frequencies in appropriate 
alloys. 

The application of internal friction measurements to the study of 
diffusion phenomena is based on equation (52). Let us contrast the 
internal friction methods to conventional diffusion measurements. In 
the latter case, there must be a concentration gradient initially present 
in the specimen, and one obtains the change in the concentration-versus- 
distance curves as a function time. From these curves a diffusion coeffi- 
cient may be obtained, which is related to the appropriate atomic 
jump frequency by the equation 


D = par ie 


where a is the lattice parameter, and § a dimensionless constant which 
depends on the geometry of the lattice structure and is of the order of 
magnitude unity. Since diffusion is a random walk problem, the mean 
distance that an atom travels from its starting position is proportional 
to t'* (t = time) and, therefore, to the square root of the number of 
atomic jumps. Thus, in order that diffusion takes place, even over 
distances of the order of 0-1 mm, a mean number of atom jumps of 
about 10 to 10" is required. If such a large number of jumps is to 
occur within the time limit of an experiment, the jump frequency must 
be very great. The magnitude of [ and, therefore, of D can be controlled 
by selecting the appropriate temperature, since both quantities obey 
an Arrhenius equation, 


D = Dge-#!8") 


i on rye H RT} 


(54) 


where H is called the heat of activation for diffusion. Conventional 
diffusion measurements are, therefore, limited to high temperatures. 
For example, the temperature at which the jump frequency is 1 sec 
is far too low for diffusion measurements, since the experiment requires 
a time of at least 10" sec or 300 years! On the other hand, such a jump 
frequency can be measured very conveniently by internal friction 
methods when the period of vibration is of the order of 1 sec. The 
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restriction of conventional diffusion measurements to high temperatures 
has led to the following limitations: (1) only a small range of values of 
D can be measured (usually D can be obtained only over two to three 
powers of ten); (2) when diffusion is due to the movement of vacancies, 
it is not possible to learn more about the process by trapping non- 
equilibrium numbers of vacancies at low temperatures and detecting 
their presence by the abnormally rapid atomic movement that they 
produce; (3) it is not possible to study the effect of dislocations (cold 
work) on diffusion because experiments are conducted above the 
recrystallization temperature; (4) precipitation cannot be studied by 
direct measurement of the rate of atomic movement in supersaturated 
solid solutions. The use of internal friction methods for the study of 
diffusion phenomena, has already contributed towards getting these 
investigations out of the strait jacket of high temperature measurements. 
In those cases in which internal friction methods are applicable, con- 
tributions have been made toward the elimination of each of the 
limitations listed above. 


Examples of Interstitial Solutions in B.C.C. Lattices 


The study of stress-induced ordering by internal friction methods has 
been most extensive in the case of interstitial solid solutions in b.c.c. 
(body-centred cubic) lattices. Most investigations have been concerned 
with carbon and nitrogen as interstitials in «-iron; in fact, the internal 
friction effects of this general type were first discovered and explained 
for the case of «-iron. In early work® it was observed that the damping 
of steel tuning forks goes through a maximum as a function of tempera- 
ture and that the location of this maximum is dependent upon the 
frequency of the fork. SNoEK® showed that interstitial solutes were 
essential to this damping peak by demonstrating that the effect no 
longer occurred when al] traces of carbon and nitrogen were removed. 
He also first explained™ the origin of the effect as follows: interstitial 
carbon and nitrogen in b.c.c. iron are not located in the largest spaces, 
but at the centres of the cell edges, i.e. positions of the type (4, 0, 9), 
and at the face centres, which are equivalent positions. An interstitial 
atom located in one of these positions will produce a distortion that has 
tetragonal symmetry, i.e. most of the distortion is in the direction of the 
two closest iron atoms (one of the (100) directions). If the directions 
of the cube axes are designated as x, y and z, respectively, the inter- 
stitial sites may be designated as z, y or z sites according to the direction 
in which maximum distortion is produced. Under zero stress, the three 
kinds of interstitial sites are occupied in a random fashion, in accordance 
with the concentration of interstitial atoms; each type of site, there- 
fore, has one-third of the total number of interstitials. Since an excess 
of atoms in any one type of site would result in an elongation in the 
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corresponding direction, our reciprocity equation (51) requires that the 
application of a tensile stress in the z direction, for example, will lower 
the energy for atoms in z sites as against z and y sites. Such a tensile 
stress will, therefore, produce a redistribution of atoms such that the 
z sites are occupied preferentially. A state of order is thus induced by 
stress where a completely random distribution existed for zero stress. 

The atomic description given by SNork to explain the internal 
friction peak observed in a-iron is so complete, that a calculation of the 
anelastic parameters A,, and + in terms of physical constants is readily 
carried out in terms of this description. It is convenient to define the 
parameter p as 

Pp . nj3 


where n is the number of interstitials and n, the number in z sites, per 
unit volume. The non-elastic strain in the z direction is given by 
equation (34), and its equilibrium value at any stress, by equation (36), 
where is now replaced by Young’s modulus in the (100) direction, 
which we will call £. The quantity u, equation (50), is now the decrease 
in energy when one interstitial is brought from an z or y site to a z site. 
When equations (34) and (36) are combined, we obtain 


A, = AE’ pio ji Gus, Ge 


In order to complete the derivation of A, and to obtain the relaxation 
time +r, it is necessary to calculate the net rate of change of n,. This 
quantity is given by 

(57) 


n. 


where n, and n, are defined analogously to n,, [,, is the rate (or 
probability per second) of transitions from y sites to z sites, and similar 
definitions apply to [’,,, ',, and [’,,. A consideration of the geometry 
of the lattice shows that, if all jumps of interstitials occur between sites 


that are nearest neighbours, then an atom in a z site may move only into 
four positions of which two are z— and two y — sites (similarly for 
other transitions). Furthermore, the activation energy is no longer the 
same for jumps into and out of z positions. Fig. 6 shows that if the 
height of the barrier under zero stress is H, in accordance with equation 
(54), then the application of stress must raise the barrier for z— z 
transitions by u/2, and lower the barrier for z — z transitions by this 
same amount; the energy difference between the z and z states, is u, 
as already mentioned. These arguments lead to the conclusion that 
i eet oo i s(1 + u/2kT’) 


and 
= 411 u/2kT) 
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where [T° is the rate of jump of interstitials under zero stress, and 
u/kT < 1 because the applied stress is assumed small. These terms may 
now be inserted into equation (57) to obtain 


; 2 un 
(n, n/3)— — » ~ 58 
8 9k7 (9°) 





T 


Fig. 6. The effect of a tensile stress in the z direction on the energy 
barrier for the jump of interstitial atoms between 2 and z sites 


where n, + n, has been set equal to n — n,, and the term wun, is approxi- 
mately equal vo wn/3, since u is already small, of first order. Comparison 
of equation (58) with equations (35) and (55) shows that 


(or, in terms of equation (52), « = 3/2), and 
p 2 un/kT ; . . (60) 


If u is assumed to depend only on o and not on p (this assumption 
means that mutual interaction between interstitials is not important, 
and should certainly be valid for the low concentration of interstitial 
solutes usually encountered) then, from the reciprocity equation (51), 
u = Ao, where / is defined by equation (34). When this result is sub 
stituted into equation (60) and the resulting expression for j inserted 
into (56), we obtain the final expression for A, 


A; (5 EB’? /k)n/T _ (61) 
The relaxation strength A, is, therefore, proportional to the solute 


concentration and varies inversely as the absolute temperature. To 
calculate A, requires only a knowledge of A = (de/dp) which can be 
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estimated™ for the important case of carbon in z-iron from the change 
in lattice parameters of tetragonal martensite as a function of carbon 
concentration. The constant that relates A, to n/7' can also be deter- 
mined experimentally from the height of the internal friction peak for a 
known concentration of solute. The value of this constant is, however, 
dependent on the crystallographic direction of application of the tensile 
stress. The above derivation applies to a stress in the (100) direction ; 
if the stress were in the (111) direction, SNogeKk’s theory predicts 
that A, 0, since all three tetragonal axes are then equally inclined 
to the direction of application of stress. The verification of this pre- 
dicted result by Diskstra,™ who found, in single crystals of «-iron, 
that A, for the (111) direction is negligible compared to its value in 
the ( 100) direction, is the most striking evidence of the correctness of 
SNOEK’S picture of the relaxation process. The value of the constant 
term in the expression (61) for A, has been obtained experimentally, 
for carbon and nitrogen in polycrystalline «-iron, by DisKsTRa; it was 
found that, for internal friction peaks close to room temperature, the 
maximum damping (which is equal to A,/2) is very closely equal to the 
weight per cent of interstitial solute. Because of the strong orientation 
dependence of A,, already discussed, it should be remembered that any 
texture in the specimens used, will lead to errors in the constant that 
relates the height of the peak to the solute concentration. On the other 
hand, relative values of the height of the peak, which may be studied as 
a function of heat treatment, give a precise measure of the relative 
residual concentration of interstitial atoms in solution at any time. 
This is the basis for the study of precipitation which will be described 
below 

Although the height of the internal friction peak is sensitive to orienta- 
tion, the temperature of maximum internal friction for a given fre- 
quency, i.e. the location of the peak, is independent of both orientation 
and the concentration of solute, but is a measure only of the jump rate 
of interstitials at that temperature (equation (59)). 

The generality of the SNoEK mechanism has been demonstrated by 
the observation of effects, in other b.c.c. lattices containing interstitial 
solutes, similar to those in a-iron. Thus K& investigated relaxation 
effects due to carbon and oxygen,™ and also to nitrogen,™ in tantalum. 
In the case of oxygen the observed internal friction peak is considerably 
broader than that corresponding to a single time of relaxation. It was, 
therefore, suggested that oxygen interstitials in tantalum may be 
located in two different kinds of positions, so that two types of jumps are 
possible. Further investigation of carbon in tantalum and also in 
columbium has been carried out by Wert. In the nonmetallic 


category, internal friction peaks observed” in glass have been attributed 
to the diffusion of sodium and potassium ions to preferential positions 
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in the glass structure, in very much the same way as in the case of 
metals. 

The internal] friction peak originating in the stress-induced ordering 
of interstitial solutes has been utilized for extensive investigations of 
precipitation from supersaturated solutions, especially in the case of 
a-iron. It has already been shown that A, is proportional to n, the 
concentration of interstitials in solution. This is not the same as the 
total concentration of solute present. The solute is usually introduced 
into the lattice by a high temperature treatment in an appropriate 
chemical atmosphere. At this high temperature, the solubility is usually 
considerably greater than at lower temperatures, so that if equilibrium 
is to be maintained, precipitation of a second phase rich in solute must 
take place at lower temperatures. The solute thus precipitated is 
usually tightly bound, in the form of an intermetallic compound, and 
cannot contribute to the internal friction. The height of the peak is, 
therefore, a measure only of the concentration of interstitial atoms in 
solid solution, and decreases as precipitation takes place. This suggests 
a unique method for following the precipitation, since the actual amount 
of precipitate can be obtained quantitatively as a function of ageing 
time and temperature. No other physical property that has been used 
to follow the course of precipitation (electrical resistivity, thermoelectric 
effect, magnetic properties, density, and microscopic investigation) is 
determined so completely by the quantity of precipitate, and is indepen- 
dent of the condition or size of precipitate particles. This method was 
first applied by DisksTra® to the study of the precipitation of nitrogen 
and carbon in «a-iron. The internal friction peak for nitrogen is at 20°C, 
and for carbon at 36°C, when the vibration frequency is about 1 sec~. 
The rate of precipitation is negligible at these low temperatures, so that 
a specimen is first quenched from the temperature at which carbon or 
nitrogen is introduced and then tempered for various periods at the 
desired precipitation temperature. After each heat treatment the 
specimen (in wire form) is quenched to prevent further precipitation 
and the height of the peak determined in a torsion pendulum. In this 
way, one can obtain the amount of solute left in solution, therefore, the 
amount precipitated, as a function of tempering time at a given tem- 
perature. To obtain a similar tempering curve at a second temperature, 
it is simply necessary to give the original specimen a re-solution heat 
treatment and quench. Using this method, DisksTra® finds that the 
precipitation of nitrogen in «-iron occurs in two successive stages. In 
the first, plates of an unknown precipitate (called ““N phase 1’’) form 
on (100) planes of the matrix (determined by microscopic examination) 
and in the second, Fe,N plates form. It is also found that precipitation 
is much faster in cold worked «-iron and that the same type of precipi- 
tate is no longer formed. This study of precipitation in iron was 
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continued by WerT® and a quantitative interpretation given to the 
measurements by Zener.” It is shown that the amount of precipitate 
q would, in the initial stages of the process, follow the law 


q b(t - ty)’ en ss Ce 


where ¢, is the time for nucleation (time at which the nucleus is formed, 
so that growth occurs thereafter by direct diffusion of solute atoms to 
the nucleus). ZENER also shows that the value of r depends on the 
shape of the precipitate particles; for spherical particles r = 3/2, for 
cylindrical particles r = 2, and for circular discs r = 5/2. The results 
for precipitation of carbon in «-iron give r = 3/2, and for the first 
stage (.V phase 1) of nitrogen precipitation, r= 5/2; the shape of 
particles in these two cases is, therefore, determined. In all cases ¢, is 
found to be zero within experimental error. Apparently nuclei are 
present as soon as the material is quenched or, if there is an incubation 
time, it is very short compared to the time for precipitation. 

The experimental curves deviate from the simple power law of 
equation (62) in later stages of the precipitation process, where com- 
petition between adjacent particles for the solute atoms in a given region 
of solution becomes appreciable. More exact laws are derived to take 
these interference effects into account.” ZENER also shows how the 
number of precipitate particles per unit volume, and, therefore, the 
mean spacing of particles, can be obtained from experimental values 
of the constant 5 of equation (62). 

As a part of the precipitation experiments, it is possible to determine 
the concentration of solute remaining in solution at any given tempera- 
ture after precipitation is complete, i.e. the solubility curve for the 
particular interstitial solute. Smaller solubilities can be determined by 
the internal friction method than those that one can usually determine 
by chemical methods. The solubility line for carbon in «-iron was 
determined” down to 150°C, where a solubility of only 10~* weight per 
cent was observed. This value is lower than all previous estimates and 
more directly obtained. 

The use of the height of the internal friction peak in interstitial 
solutions to measure the degree of precipitation is extended to the study 
of precipitation of carbon and nitrogen in cold worked «-iron by 
Harper.” For a sufficiently large amount of cold work (5 per cent 
extension is sufficient) normal precipitation, as studied by Wert, does 
not occur. Instead, an analysis by CoTrreLtL” and Brtpy”™ suggests 
that precipitation of solute atoms into edge dislocations takes place. 
These authors calculate that such precipitation will follow a law of the 
form of equation (62) with r = 2/3. Harper verified this power law 
experimentally, and with the aid of the formula of CoTTRELL and 
BrtBy, obtained the density of dislocations in the specimens for 
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different degrees of cold work. Similar results were obtained for both 
carbon and nitrogen in «-iron. It is striking that these two cases, for 
which normal precipitation occurs according to equation (62) with two 
different r values, follow a 2/3 power law when precipitation occurs in 
a cold worked lattice. 

The final illustration of the versatility of SNoEK’s internal friction 
peak is its application to the study of diffusion. The potentialities of 
internal friction methods for such investigations have already been 
discussed in connexion with equation (52). In the case of interstitial 
solutes, we have derived equation (59), which enables us to determine 
atomic jump rates directly, and quantitatively, from measured relaxa- 
tion times. The combination of low temperature internal friction 
measurements with high temperature conventional diffusion measure- 
ments makes possible the determination of D and I over a very wide 
range; H and D,, or I’, (equation (54)), may then be determined with 
great precision. It would seem that the atomic jump rate I" is the most 
fundamental quantity with which we are here concerned, and that both 
the measured diffusion coefficients and relaxation times would be con- 
verted into the appropriate values of [ by means of equations (53) and 
(59), respectively. The custom of reporting diffusion coefficients has 
been so widespread, however, that more often 7-values are converted 
into diffusion coefficients and compared directly with high temperature 
diffusion measurements. This focus of attention on D rather than [ 
does not lead to any complications in interpretation, because of the 
simple proportionality between the two quantities. SNoEK®: © reported 
values of D, and H for diffusion of carbon and nitrogen in «-iron, based 
on his measurements. More precise internal friction data were later 
reported by WERT and ZENER.”* The most complete data for carbon 
in «-iron are obtained” by the combination of static relaxation measure- 
ments with internal friction and conventional diffusion measurements,” 
which give D, = 0-02 cm*/sec and H = 20,100 cal/mole, from a straight 
line plot of log D versus 7’ that covers fourteen powers of ten in D 
over the temperature range — 35°C to 800°C. It has been shown” that 
values of D, may be predicted theoretically if it is assumed that the free 
energy of activation, or the work required to bring an interstitial atom 
to the midpoint between two interstitial sites, goes entirely into elastic 
straining of the surrounding lattice. The value of D, calculated from 
this theory for the case of carbon in iron is in excellent agreement with 
the experimental result reported above. Earlier reported values of D, 
had not fitted the theory so well, because of the difficulty in a precise 
determination of D, when the data for D are only over a limited tem- 
perature range. (D, is obtained from the intercept at 1/7’ = 0 of the 
straight line plot of log D versus 1/7. Low temperature data, as 
obtainable by internal friction methods, are, therefore, very valuable in 
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making possible a precise extrapolation of the straight line.) In other 
systems, particularly nitrogen in iron and carbon in tantalum, the 
agreement of reported values of D, with the theory has improved,” 
as the range of measurement was extended. This verification of the 
strain theory of D,, wherever precise data for interstitial diffusion is 
available, suggests that, in other interstitial solutions, the heat of 
activation can be obtained from only a single measurement of the 
diffusion coefficient D by the substitution of the calculated D, into 
equation (54). 


Examples of Random Substitutional Solutions 


A substitutional solid solution that shows no order under zero stress 
may also show anelastic effects resulting from stress-induced ordering. 
Such effects have only been observed at relatively high concentrations 
of solute (when the solute is present to ten or more atomic per cent), 
in contrast to the interstitial case where minute amounts of solute 
produce a sizable internal friction peak. The first observation of an 
internal friction peak due to stress-induced ordering in a substitutional 
solution was that of ZENER” in the case of a single crystal of «-brass 
(30 per cent Zn). These measurements were only at one frequency and 
composition and the origin of the peak was not explained. Later K&*° 
verified the existence of such a peak in (70-30) a-brass at lower fre- 
quencies, and obtained a heat of activation in reasonable agreement 
with the heat of activation for diffusion of zine in brass. A possible 
mechanism for this relaxation effect in brass was suggested by ZENER™ 
and will be discussed below. The investigation of this type of anelas- 
ticity was extended by the writer** to «-silver-zine alloys (silver rich), 
where similar, but larger, effects were anticipated and found (due to the 
greater difference in atomic radii of Ag and Zn as compared to Cu and 
Zn). Extensive work™ was carried out on the Ag-Zn system and the 
dependence of internal friction on composition and temperature were 
obtained. The observed internal friction peak is clearly a volume effect, 
since it is independent of grain size. However, if the grain size is too 
fine, the grain boundary internal friction (see the next section) also 
occurs, thus requiring a resolution of the two effects. The most success- 
ful investigations of the ordering internal friction are, therefore, carried 
out on single crystals or coarse-grained polycrystalline specimens. 
Clearly, the explanation for the observations in substitutional alloys 
cannot be the same as for interstitial alloys. For a lattice that has 
cubic symmetry, any elastic distortion due to the substitution of a 
larger or smaller atom at one lattice site must also have cubic symmetry. 
There are, therefore, no preferential sites for such atoms when stress is 
applied. On the other hand, the distortion about a pair of solute atoms 
in nearest neighbour positions no longer has cubic symmetry; a pair of 
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solute atoms will produce tetragonal distortion about the direction of the 
pair axis. This loss of cubic symmetry in the case of pairs of atoms is 
the basis for the explanation given by Zrenrer*™ for the anelastic 
behaviour of substitutional alloys. He suggests that pairs of substitu- 
tional solute atoms behave completely in analogy to single interstitials. 
For example, if the solute atom is larger than the solvent, then under 
zero stress, pairs of solutes will be randomly distributed over all 
crystallographically permissible directions. If, now, a tensile stress is 
applied in a particular direction, the pair axes will show a preferential 
alignment in those crystallographic directions which are the nearest to 
the stress direction. The attempt of the lattice to establish a state of 
equilibrium at each value of stress will give rise to internal friction in 
exactly the same way as in the case of interstitial solutions. The time 
of relaxation will be closely related to the rate of atomic movement in 
the alloy. A high enough concentration of solute is required, in the 
substitutional case, for the occurrence of pairs of atoms to be reasonably 
probable. This stress-induced preferential orientation of pairs of atoms 
is a form of stress-induced ordering. The type of order involved does not 
fall into the category of either long range or short range order, for it is 
assumed that the pairs themselves are formed by random fluctuations. 
The phenomenon is one of stress-induced ordering nevertheless, since 
the preference for special directions of pair axes is a deviation from 
randomness. 

Although the description of the relaxation phenomena in substitu- 
tional solutions in terms of reorientation of solute atom pairs has the 
advantage of its direct analogy to the case of interstitial solutions, it 
cannot be taken too seriously for several reasons. The “‘pair’’ concept 
is probably most nearly satisfactory at low solute concentrations, where 
most solute atoms are far apart and the number of pairs is quite small. 
Even in such a dilute solution, where the internal friction effects are 
probably too small to be observed, it is not correct to think of a fixed 
number of pairs of solute atoms, since a tendency toward preferential 
creation or dissolution of nearest neighbour pairs may occur under stress. 
In actual measurements, solutions containing as high as one-third solute 
atoms have been used ; here clusters of solute atoms considerably more 
complex than pairs occur with great frequency. The behaviour of such 
clusters under stress cannot readily be taken into account in terms of a 
theory of reorientation of solute pairs. In fact, the very concept of 
solute and solvent becomes ambiguous as the concentrations approach 
50 per cent ! 

If the pair-reorientation picture is an oversimplification, and, there- 
fore, not an acceptable explanation for the observed behaviour of 
substitutional solutions, can we still be sure that the phenomenon is one 
of stress-induced ordering, controlled by volume diffusion within the 
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alloy? There is excellent evidence that leads to an affirmative answer 
to this question. In the first place, these effects are observed in well- 
annealed single crystals and are not structure sensitive. In fact, the 
observations are quantitatively reproducible in different specimens of 
the same alloy. The possibility that the observed phenomena are 
related to the presence of internal interfaces or of dislocations is, 
therefore, discounted. Secondly, these anelastic effects are only 


observed in alloys, and are, apparently, larger the greater the atomic 
size difference of the solute and solvent atoms. Finally, the width of 
the internal friction peak corresponds very closely to a single time of 
relaxation, and the heat of activation is of the correct magnitude for 
volume diffusion. In view of this evidence, it seems almost certain that 
the relaxation is controlled by the movement of atoms in the lattice, 
which acts to produce an ordered condition of the alloy under an applied 
stress. It is, therefore, expected that the measured time of relaxation 7 
and the appropriate jump frequency I’, are related by equation (52). 
The dimensionless constant « cannot be calculated, as it was in the 
interstitial case, until we know more precisely the exact nature of the 
ordering process. A reasonable value for « can be estimated,** however. 
In the interpretation of equation (52), it is of great importance to 
specify what is the appropriate jump frequency [. The interpretation 
of this quantity is not as obvious in a substitutional alloy as in the 
interstitial case; in a substitutional solution there are two significant 
rates of atom movement, those corresponding to each of the two types of 
atoms, say, A and B. If the mechanism of diffusion is not a direct 
interchange of lattice positions, these two rates, [, and [',, may differ 
significantly. The well-known experiments of KrrKENDALL™ and of 
Da Stmrva and Ment give evidence that the two atomic species 
actually do not move at the same rate, at least in several f.c.c. alloys. 
It is, therefore, probable that diffusion occurs by means of the move- 
ment of lattice vacancies.** In contrast to conventional diffusion 
measurements where the faster moving atoms make the major contri- 
bution to the measured diffusion coefficient,*” the stress-induced 
ordering process is controlled by the rate of movement of the slower 
atomic species. This conclusion may be demonstrated as follows: 
let us suppose that B atoms move much faster than A atoms, or 
[, > [,. At a reasonably high concentration of B, it becomes very 
improbable that a given B atom does not have at least one nearest 
neighbour that is also B. The B atoms, therefore, form a continuous 
network throughout the lattice. If diffusion is by means of vacancies, 
most of the movement of vacancies will be along this network of B 
atoms, because the rate of jump of a B into a vacancy is considerably 
greater than that of an A atom. No redistribution of atoms, or ordering, 
can be produced by such movement of vacancies along the B network, 
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since for every B atom that has jumped another simply takes its place. 
Only on those relatively rare occasions when an A atom makes a jump 
will a contribution toward re-arrangement of the atom distribution in the 
lattice occur. Thus I’, controls the rate of relaxation, and this quantity 
is the appropriate jump frequency to be inserted in equation (52). The 
interpretation of the internal friction effects in terms of the pair 
reorientation of solute atoms, discussed above, leads one to emphasize 
the importance of the movement 
of solute atoms,** which may not 
actually be the rate controlling 
factor at all. 

An illustration of internal 
friction data for Ag-Zn alloys is 
given in Fig. 7, which shows a 
series of internal friction peaks 
at one frequency for different 
compositions. The figure shows 
that both the height and location 
of the peaks are very sensitive 
to chemical composition. The 
magnitude of the maximum 
damping falls off rapidly as the 
concentration of zinc decreases, 
due to the smaller degree of 
ordering that can be produced Fig. 7. Internal friction data for a series 
at lower concentrations. The of silver-zinc alloys (frequency of vibra- 


temperature at which interna] tion 0-8 cycle per second at room 
temperature). The atomic per cent zinc 
in the various alloys are 30-2, 24-2, 19-3, 


on 


/emperoture 
































friction is a maximum, or at 
which the time of relaxation = . 

‘ : é and 15-8, corresponding to the curves A 
1/w is higher the lower the zinc to D respectively 
concentration. This result means 
that the controlling rate of atomic movement at any given temperature 
increases with increasing concentration of zinc. In the case of the lowest 
peak, an additional high temperature damping associated with grain 
boundaries becomes significant enough to distort the ordering peak. 
When data similar to that of Fig. 7 is obtained at different frequencies, 
the temperatures at which the peaks occur, are correspondingly shifted. 
By carrying out experiments over a wide range of frequencies, the relaxa- 
tion times for the various silver-zinc alloys may be obtained over a wide 
range of temperatures, as shown in Fig. 8. Data at the lowest tempera- 
tures can only be obtained by the use of frequencies too low to be 


practicable, and are, therefore, most conveniently obtained by static 
methods instead of by vibration. The curves of Fig. 8 represent 
essentially the variation of atomic mobility with temperature. 
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Unfortunately, no reliable conventional diffusion data is available for 
extension of these curves to high temperatures. The data obtained 
by internal friction and static relaxation methods cover a sufficiently 
large range, however, and is sufficiently precise, that a deviation from 
on the usual straight line 


semi-logarithmic relation 





is quite apparent A 
straight line plot for log 
[ versus 7 is only to 
be expected if H is a 
constant (equation (54)). 
There is no theoretical 
reason, however, why H 
may not be dependent 
on 7. Conventional dif- 
fusion data have never 
been reliable enough to 
report such a deviation, 
except when channels 
for diffusion, other than 
volume diffusion, occur 
(e.g. diffusion down grain 
boundaries). The curves 
of Fig. 8 also show a 
strong concentration de- 
pendence of atomic 
mobility. An increase of 
about 10 per cent in zinc 
concentration means an 
increase in the controlling 





rate of atomic movement 


by a factor of ten. It is 
"ig. 8 “he variation of the t e of relaxation fo : 
Fi rh variation of tl im t ixation for interesting to try to see 
stress-induced ordering with temperature, for a .. . 
if this strong concentra- 
series of silver-zinc alloys 
tion dependence resides 


primarily in the term [, or in H. Extrapolation of the curves of 
Fig. 8 to T- 0 shows that the values of I, for the various concen- 
trations are very nearly the same and, therefore, that the differences 


in atomic movement as a function of concentration are principally 
determined by differences in the heat of activation H. The intercepts 
[,, and the corresponding values of D,, may be compared with the 
strain theory of D, (discussed earlier in connexion with interstitial 
diffusion), which has also been applied*® to substitutional solutions. 
The agreement with theory is quite good. In the case of conventional 
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diffusion data, the agreement between reported values and theoretical 
values has generally been poor, but this lack of agreement has been 
related® to the presence of mechanisms other than volume diffusion 
in many of these experiments. 

In addition to the above experiments, the stress-induced ordering 
effect has also been used for experiments fundamental to our under- 
standing of the mechanism of diffusion. In terms of the vacancy 
mechanism of diffusion, which has been widely accepted, the rate of 
diffusion or atom movement depends on the product of the concentra- 
tion of vacancies and the rate of jump of atoms into vacancies. Both 
of these factors are strongly temperature dependent, but conventional 
diffusion measurements are incapable of separating the two. On the 
other hand, by means of low temperature relaxation measurements it 
has been possible to detect the presence of a nonequilibrium concentra- 
tion of vacancies in a silver-zinc alloy that had been quenched rapidly 
from a high temperature.” The effect of such quenching is to trap some 
of the relatively large number of vacancies present at high temperatures, 
which then produce abnormally rapid atomic movement, and, therefore, 
abnormally short relaxation times, at low temperatures. For example, 
the atomic jump rate, at 50°C is increased by a factor of 10° over the 
equilibrium rate in a Ag-Zn wire that had been quenched from 400°C 
into water at room temperature. Such experiments not only give strong 
additional evidence for a vacancy mechanism, but also make it possible 
to study separately the two factors which determine the rate of atomic 
movement. Furthermore, from observations on the rate at which the 


“quenched-in”’ vacancies disappear, it is hoped that the mechanism by 


which lattice vacancies are created and absorbed will be determined. 
It has been postulated,” * for example, that dislocations act as sources 
and sinks for vacancies. 


Examples of Initially Ordered Substitutional Solutions 


If a solid solution shows an incomplete state of long range order under 
zero stress, then, in accordance with the theory presented in the 
beginning of this section, a change in the state of order will be produced 
by an applied stress if the ordering process is accompanied by a change 
of lattice dimensions. This phenomenon of stress-induced change in the 
degree of order, first predicted by Gorsky,™ leads to mechanical 
behaviour corresponding to that of the standard linear solid. Unlike 
the two cases already considered, for which the solid solution is dis- 
ordered under zero stress, the characteristic internal friction peak in the 
present case is observable only as a function of frequency (at constant 
temperature), and not as a function of temperature, at constant fre 
quency. The reason for this difference is that, for the initially ordered 
solution, the degree of order at zero stress is itself a function of tempera- 
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ture. Thus, experiments which simply determine internal friction as a 
function of temperature are difficult to interpret. Such experiments, 
carried out by Koster®~-* and collaborators on several different 
ordering alloys, show that internal friction increases to very high values 
in the vicinity of the Curie point for long-range order, and continues to 
rise, as the temperature is increased into the disordered region, until 
values are obtained which are outside the range that the apparatus can 
measure (about ¢ = 0-1). In addition to the fact that the degree of 
order at zero stress depends upon temperature, it has also been shown® 
that in the immediate proximity of the Curie point A, becomes very 
large, since in this range the ordering energy is a very rapidly varying 
function of the degree of order. It is, therefore, difficult to draw quanti- 
tative conclusions from the type of measurements obtained by KOsTEr, 
unless supplemented by other experiments. 

In the case of CuAu, whose ordered structure is tetragonal (c/a = 1-1), 
striking relaxation effects are to be expected, due to stress-induced 
changes in the degree of long-range order. The existence of such effects 
was predicted™ and confirmed® by Gorsky, who carried out elastic 
after-effect measurements at constant temperature. (These experiments 
are analogous to measurements of internal friction as a function of 
frequency of vibration.°) 


INTERNAL FRICTION ORIGINATING IN INTERFACES 
Stress Relaxation Across Incoherent Interfaces 


Internal interfaces in a metal provide important sources of internal 
friction. A widely studied source of damping is the relaxation of shear 
stress across incoherent interfaces, such as large angle grain boundaries 
and interfaces between different phases. The grain boundary case has 
already been reviewed in this series ;° the discussion here will, there- 
fore, be brief and will mainly concern itself with the question of the 
distribution of interfaces. This point of view was recently reviewed by 
the writer.’ 

The concept of stress relaxation across an interface is readily des- 
cribed if we imagine a single interface of limited area located in a metal, 
a hypothetical case, to be sure. The application of a shear stress to such 
a specimen produces a state of uniform stress throughout the material, 
but this state will not remain indefinitely. Because of the inability of an 
incoherent interface to sustain a shear stress across it, stress relaxation 
will take place by a process of slipping across the interface until the 
stress there has gone to zero. Shear stress, therefore, becomes redistri- 
buted so that it is concentrated along the edges of the interface. This 
relaxed state is indicated schematically in Fig. 9, where lines of force are 
used to represent the stress field, such that the number of lines cutting 
across @ unit area is proportional to the shear stress. Because of the 
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increased stress at the ends, the over-all shear strain in the material is 
increased as a result of this relaxation process. When stress is released, 
the existence of a reverse stress across the interface, results in the 
material being restored to its original state. 

Such a material, therefore, behaves like a standard linear solid, where 
the parameter p, which is now the difference between the mean stress 
and that across the interface, approaches its equilibrium value according 
to a relaxation equation (equation (35)). It is readily shown’ that the 
time of relaxation 7 is proportional to the linear dimension of the inter- 
face, but the temperature dependence of 7 (i.e. the activation energy H) 
is independent of the size of the interface if the unit mechanism of the 
process is everywhere the same. Fig. 9 shows that, in the relaxed 
condition, the shear stress undergoes a significant decrease in an 


Fig. 9. Schematic representation of an incoherent 
interface across which relaxation of shear stress has 
occurred. The shear stress at any point is the number 
of lines per unit area. The cross section of the sphere 
of relaxation is represented by the dashed circle 








approximately spherical region whose cross-section is outlined by the 
dashed circle. This region is called the sphere of relaxation. 

In a well-annealed polycrystalline metal there is a regular-collection 
of interfaces similar to the one just described, in which relaxation, or 
so-called “‘viscous slip,” is held up at the edges and corners where the 
various boundaries meet. The spheres of relaxation will not overlap 
significantly, which means that each interface responds to stress 
independently of the others. The mechanical behaviour of the material 
is, therefore, a simple superposition of the contributions due to the 
individual interfaces. Since the linear dimensions of all interfaces are 
not the same, the internal friction peak is usually considerably broader 
(by a factor of 2 or 3) than the peak which corresponds to a single time 
of relaxation. This is the well-known “grain boundary peak” that has 
been studied by Ke*. 1° and reviewed in the article by Kine and 
CuaLMeErRS.’” The shift of this peak with frequency gives the heat of 
activation H for the unit mechanism responsible for the relaxation. 
A very large AM effect accompanies the grain boundary relaxation; the 
apparent modulus at low frequencies may be 30 per cent lower than the 
equivalent value for a single crystal. 

On the high temperatures side of the grain boundary peak, the 
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internal friction, falls at first, but then begins to rise again. This effect 
has been attributed*: ** to slipping at the interface corners where the 
stress is concentrated after relaxation; the same phenomenon results 
in a permanent set when a static stress is applied. These effects were 
previously interpreted differently by Ks&.' 

Internal interfaces which are not allowed to grow at high tempera- 
tures to form a regular array, like the grain boundaries of a well 
annealed polycrystalline metal, may show more complicated effects. 
If the array of interfaces is irregular, or somewhat haphazard, so that 
large overlapping of the spheres of relaxation occurs, the behaviour of the 
collection of interfaces can no longer be regarded as a simple super- 
position of the behaviour of the individual interfaces, but instead we 
obtain an example of coupled relaxations (page 21). In this case, when 
one interface relaxes it produces a significant increase in the shear 
stress across a second, and vice versa, so that very large non-elastic 
strains may be produced. Such a description was used to explain the 
large internal friction obtained after discontinuous precipitation in an 
aluminium-zine alloy.** Instead of the usual broadened peak, the 
internal friction seems to climb to exceedingly high values at high 
temperatures, or, if temperature is held constant, at low frequencies. 
Such behaviour could still be described, phenomenologically, in terms of 
a superposition of relaxation times, but the range of times required 
would be very large and the values would have little physical signifi- 
eance. The fact that highly cold-worked materials show this type of 
behaviour, and the interpretation of this fact, will be discussed in the 
next section. 

Although the general descriptive features of stress relaxation across 
incoherent interfaces may be given without knowledge of the atomic 
mechanism responsible for this behaviour, the explanation of the 
kinetics of the process must be closely tied in with the atomic mechanism. 
Thus, much attention has been given to values for the heat of activation 
for grain boundary stress relaxation. On the basis of very little evidence 
available at the time, K£'™ suggested that the heat of activation for 
grain boundary relaxation may be the same as that for volume diffusion. 
This correlation is a surprising one, inasmuch as there seems to be no 
theoretical reason for it. Subsequent experimental evidence**: ', 1% 
has shown values of the activation energy for grain boundary relaxation 
both greater and smaller than that for volume diffusion. Apparently, 
the suspected correlation is not correct, but the mechanism of stress 
relaxation is still not understood. A mechanism suggested by Morr,’ 
and discussed at length in the earlier article,’ is disputed on the basis 
of evidence presented by the writer.'®' The effect of minute amounts of 


impurities on the activation energy for stress relaxation is not clear, 
as yet, although it is known that impurities may have a large effect on 
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both the position and height of the grain boundary internal friction 
peak.*4, 1° Further investigations of the heat of activation for stress 
relaxation across grain boundaries will most certainly contribute to our 
understanding of the nature of the grain boundary. 


Movement of Twin Interfaces 

When transformation from a cubic to a tetragonal structure occurs, 
heavy twinning is necessary in order to avoid high residual stresses in 
the material. Such twinned structures have been observed in a Mn-Cu 
(12 per cent Cu) alloy’ as well as in In-T] alloys. An internal friction 
peak with maximum at about 0°C was observed by WorRELL for the 
Mn-Cu alloy. Inasmuch as crystallographic coherency exists across a 
twin interface, the explanation for the observed internal friction cannot 
be the same as for the viscous-like slipping of incoherent interfaces. 
The tetragonality of the lattice structure suggests that the application 
of shear stress will induce movement of a twin interface normal to 
itself, in such a way as to enlarge the region where the direction of 
tetragonality is favourable with respect to the applied stress relative to 
that in which it is unfavourable. The fact that an internal friction peak 
is observed, leads to the belief® that this movement of twin boundaries 
is a relaxation phenomenon. Further measurements have been made” 
for the Mn-Cu alloy, but internal friction has not yet been studied as a 
function of frequency. There is, therefore, still some uncertainty as to 
whether the internal friction actually originates in a relaxation process ; 
if it does, the significance of the time of relaxation must be ascertained. 


INTERNAL FRIcTION RELATED TO PLASTIC PHENOMENA 


The type of internal friction to be discussed here is related to the 
internal stresses, or dislocations, introduced by cold working and is 
commonly referred to as “cold work internal friction.” The latter 
designation is not accurate inasmuch as effects are also observed due 
to the residual] dislocations in well annealed materials. 

In contrast to our relatively recent awareness of the other sources of 
internal friction discussed in this article, the study of damping of 
plastic origin has a long history. As pointed out by GumLLet,™ even 
the earliest bell makers were cognizant of the fact that brittle materials 
gave the lowest damping. In recent years, considerable effort has been 
directed toward the study of this cold work internal friction. Yet, in 
spite of the effort expended, our understanding of the nature of this 
type of damping is in a very elementary stage. The reasons for such 


slow progress are many. The effects are very complex and a consider- 
able number of variables are involved, among them, the degree of cold 
work and the type of cold work given the specimen material, the time 
at room temperature before measurement, the frequency of vibration, 
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and the impurity content of the specimen. Much of the work in the field 
is not fundamental in character but is concerned only with the measure- 
ment of internal friction, at one temperature and frequency, as 4 
function of some alloying element or a particular heat treatment. Often 
no attempt is made to see if other sources of internal friction may also be 
present to complicate the results. As a consequence, it is usually difficult 
to compare data by different investigators, which often contradict 
each other. 

On the brighter side, there have been a sufficient number of funda- 


mental investigations to give us a collection of important facts. These 
basic facts will first be reviewed without any attempt to interpret them. 
Finally, a discussion in terms of dislocation theory will attempt to see 


to what extent the factual material can be explained. 


Effe ct of Cold Working 

The room temperature internal friction of a well annealed material 
usually has a very low value, of the order 10-°, although this value is 
apparently very sensitive to the purity of the material. The structure 
sensitivity of this damping, especially of single crystals, is most striking ; 
various investigators’: 4?. "3 report that slight jarring or almost any 
handling operation will produce a large increase in damping. In addi- 
tion, two single crystals prepared of the same material and in exactly 
the same way differ in their internal friction values, sometimes by as 
much as a factor of ten.“*." Similarly different parts of one crystal 
do not show the same internal friction.” ™* 

The application of a small static stress to a well annealed material 
will usually raise its internal friction appreciably, even though no 
observable yielding has occurred. The duration of loading seems to be 
important since it seems possible™*® to produce a larger damping by 
dropping or jarring a specimen, than by equivalent static loads. This 
point bears further investigation, however. Systematic investigation 
of the effect of cold working on damping has been carried out using 
various systems of stressing: application of compressive loads, pulling 
in a tensile machine, drawing through dies. One cannot be certain that 
the same types of internal stresses are always produced by these various 
methods and, therefore, that the data obtained are always comparable. 
Several authors show that the room temperature internal friction as a 
function of cold work goes through a maximum. ZENER, CLARKE and 
SmiTH™ report such a maximum in polycrystalline brass at about 5 per 
cent elongation, for measurements at a frequency of 2000 c.p.s. The 
observed damping, even at the maximum, is very small, but this may 
be related, as we shall see, to the fact that about two weeks elapsed 
between the cold working and the measurements. Lawson™® reports 
very striking maxima for spectroscopically pure polycrystalline copper, 
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measured at 50 kc/sec. Correspondingly, a minimum is observed in the 
dynamic Young’s modulus of the specimens (AZ/E about 6 or 7 per 
cent) at the same amount of cold work that produces the maximum 
damping. The height of the maximum in the room temperature damp- 
ing is greater the lower the temperature at which the deformation is 
carried out. Hasicuti and Hrrar™ obtain similar maxima for copper 
single crystals, measured at frequencies in the tens of kilocycles. The 
maxima are not identical for different specimens, but occur for 1-10 per 
cent elongation and for values ¢ x 10° in the range 3-10. For large 
amounts of cold work the internal friction apparently decreases to a 
constant value greater than the value for the annealed specimen. 
BovuLANGER"™’ also observes a maximum in the residual damping (after 
room temperature annealing) of a nickel-chrome alloy. Other investi- 
gators report no maximum, but a monotonic increase of damping with 
cold working of aluminium™®*: ¥° and iron,” although the rate of rise of 
the curves is considerably decreased for large amounts of cold work. 
In cases where simultaneous observations of the dynamic modulus are 
made, a decrease in the modulus is generally observed to parallel an 
increase in the internal friction. This decrease in modulus may be as 
large as several per cent.” As we shall see, only part of this decrease 
may be interpreted as a AM effect. 

Although the behaviour of pure and impure materials as a function of 
degree of cold work seems to be about the same, qualitatively, the 
magnitude of the changes produced by cold working is greater for the 
purer materials."*, 4% 


Effect of Annealing 


The conventional description of the effect of annealing a cold worked 
metal at successively increasing temperatures is that no significant 
internal changes occur until recrystallization begins, except for the 
relief of “‘macrostresses’’ responsible for stress-corrosion cracking.'** 
This description is based on the fact that resistivity, x-ray line broaden- 
ing, microstructure, hardness, and tensile strength all show essentially 
no change as a result of annealing below the recrystallization tempera- 
ture. (Hardness and tensile strength may even increase slightly for 
low temperature anneals.) It has, therefore, been supposed that the 
internal stresses produced by plastic flow are not relieved before 
recrystallization occurs. Internal friction measurements show sur- 
prisingly different phenomena. The room temperature damping (usually 
at high frequencies) produced by cold working may recover completely 
to the value it had before cold working, after anneals at temperatures too 
low to produce any changes in the properties mentioned above, as first 
shown by Forster and KOster® for brass and low-carbon steel. The 
prior recovery of internal friction has no influence on the tendency of the 
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material to recrystallize at sufficiently high temperatures and, con- 
versely, the room temperature damping is little affected by the onset of 
recrystallization, if this damping has already recovered during a low 
temperature anneal. More extensive investigations of the annealing of 
cold worked a-brass are reported by KOsterR and Rosentua.™ It is 
shown that sizeable recovery of internal friction occurs at room tem- 
perature in tens of hours. Measurements of the damping as a function 
of time show that the rate of recovery is greatest at the shortest times. 
Since measurements in these experiments were started one-half hour 
after cold working, initial values of the damping may have been con- 
siderably higher than any of the reported values. Very rapid measure- 
ments that may be taken immediately after plastic deformation, are 
therefore desirable in annealing studies. The recovery of internal 
triction at room temperature occurs in about the same time for specimens 
given prior extensions of from 10 to 60 per cent, except that the starting 
values are higher for the greater elongation. The work of Késter and 
ROSENTHAL indicates that the recovery of room-temperature internal 
friction in cold worked brass takes place in a similar manner at various 
annealing temperatures, the time of recovery decreasing from tens of 
hours at room temperature to minutes or seconds at 200°C. Similar 
measurements on cold worked brass are reported by Fusre.ip,' who 
shows that the effective heat of activation for the recovery process is 
substantially lower than 10,000 calories per mole. Observations of the 
effect of annealing on internal friction in aluminium" and iron’ are 
reported by Koster. The results for aluminium seem to stand alone in 
that they show incomplete recovery below the recrystallization tempera- 
ture. The case of z-iron is very striking; here the recrystallization 
temperature is at 500-600°C, yet damping recovers completely at room 
temperature in ten hours, and at 100°C in minutes. Similar effects are 
also observed in copper... * 

The ability of cold worked materials to anneal at room temperature, 
as far as internal friction measurements are concerned, may, in part, 
explain some of the discrepancies in the effect of cold work on internal 
friction as reported by various investigators. The results of Bot 
LANGER’ show, for example, that whether or not a maximum is 
obtained in the curve of damping versus degree of cold work, depends 
on the time that has elapsed between the cold working and the measure- 
ments 

The distinct difference between the effect of annealing on internal 
friction and on most other properties leads to the almost unavoidable 
conclusion that there are at least two distinct types of internal stress 
one responsible for hardness and gross plastic deformation, which is 
required in order that recrystallization may occur, and the second 
responsible for internal friction effects 
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The dynamic modulus is apparently somewhere between hardness 
and internal friction in its reaction toward annealing. A small part of 
the large decrease in modulus, produced by large amounts of cold work, 
recovers together with the internal friction; the remainder is not 
affected by anneals below the recrystallization temperature. This 
residual effect is clearly observed,’ either after annealing for a long 
time at room temperature, or after successive anneals at increasing 
temperatures have completely eliminated the cold work internal fric- 
tion. The fact that the recoverable part of the decrease in dynamic 
modulus is eliminated in the same time as the internal friction, is strong 
evidence that it represents a AM effect, i.e. it results from an additional 
in-phase component of strain produced by the same non-elastic 
mechanism as that responsible for the damping. The part of the 
decrease in dynamic modulus which is not recoverable below the 
recrystallization temperature is apparently a lowering of the modulus 
M’ (equation (9)) that relates the stress to the elastic strain. That the 
presence of internal strains will produce such a lowering of the elastic 
modulus, was demonstrated by ZENER. One should also keep in 
mind the fact that large amounts of cold work may also change 
the modulus by producing preferential orientation (texture) in a 


polycrystalline material. 


Strongly Amplitude Dependent Damping 

Although amplitude dependence of internal friction is observed**. 1%6 
for vibration at large strain amplitudes, it had been believed’ that, so 
long as the maximum strain amplitude in a specimen is kept below 
10~* or 10-*, damping is independent of amplitude. That this belief is 
not always correct is strikingly demonstrated by the experiments of 
Reap,*: ?° which show that the internal friction of annealed and slightly 
cold worked single crystals of zinc and copper is strongly amplitude 
dependent, even at strain amplitudes of 10~-’; for the case of zinc the 
damping may increase by almost one hundred times from low ampli- 
tudes up to 10-*. These experiments, which use the piezoelectric 
method, also show that, associated with the amplitude dependent 
damping, there is an amplitude dependent A effect, as measured by 
the decrease in resonant frequency with increasing amplitude of 
vibration. The resonant frequency in these experiments is in the tens of 
kilocycles. It is shown that the curve of damping versus strain ampli- 
tude for zinc crystals is not reversible, i.e. different values are obtained 
for decreasing amplitudes than those for increasing amplitudes. These 
results are interpreted as evidence that the measurements themselves 
produce internal changes in the crystals. Further experiments on these 


effects in zinc crystals are reported by several investigators.’*’, 4. #® 


The experiments on copper crystals show considerably simpler effects, 
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probably because room temperature represents a much lower ‘“‘reduced 
temperature” for copper than for zinc. (Copper has a much higher 
melting temperature and recrystallization temperature than zinc.) The 
damping-amplitude curves for copper crystals are not affected by the 
measurements themselves and, therefore, represent a measure of the 
mechanical state of the material. Further experiments on copper 
crystals are those of Marx and 
oaled KogHxLER™ and of the writer." »"* 

f Fig. 10, taken from reference 





12, shows the effect of succes- 
r le ny sive applications of a static 
compressive stress on the curves 
of internal friction versus strain 
amplitude, for a copper crystal. 
The stresses were sufficiently 
small so that no observable 
change of length of the crystal 
occurred as a result of stress- 
ing. Similar data were also 
y T | T——_ given by Reap.” It should be 
noted that these curves start 
in a parabolic fashion, so that 


r t — t +——— the damping below a strain 
| amplitude of 10~-’ is practically 

independent of amplitude. One 
; could then ignore the amplitude 
dependent part of the damping 


- and concentrate only on the 
al ._J internal friction at very low 











amplitudes. On the other 
hand, the strong amplitude 
dependence seems to be the 
most characteristic feature of 


Fig. 10. Effect of successive applications of 
static compressive stress on the amplitude- 
dependent internal friction in a single 
crystal of copper this type of damping. Further- 

more, it is possible to measure 

very precisely the changes in resonant frequency with strain amplitude, 
so that the amplitude dependence of both the AW effect and the damp- 
ing may be obtained simultaneously. (Changes in resonant frequency 
with amplitude can be measured to better than | part in 50,000.) For 
these reasons, the work of the writer concentrates on a quantitative study 
of the amplitude dependent internal friction and A.V effect. The experi- 
mental results show that the ratio of the amplitude dependent part of 
damping to that of the A.V effect is independent of frequency and of the 
temperature ; this fact could not be explained in terms of any mechan- 
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ism of viscoelastic behaviour. On the other hand, the experimental 
values of this ratio are in quantitative agreement with the predictions 
of the theory of static hysteresis as given by equations (49). The 
exceptional amplitude dependence of this type of internal friction might, 
in itself, have led us immediately to suspect a mechanism of static 
hysteresis. This conclusion is suggested by the discussion (in the earlier 
section on static hysteresis) which shows that such a mechanism 
inherently leads to strong amplitude dependence because of the non- 
linearity of the static hysteresis loops, in contrast to viscoelasticity, 
which is essentially a linear phenomenon at low amplitudes 

The reader may wonder why the phenomenological mechanism might 
not have been decided with certainty by a simple study of the frequency 
dependence of internal friction. Such measurements do not provide 
decisive results in the present case, because of the extreme structure 
sensitivity of the property we are measuring. As long as one observes 
that two crystals, grown and treated identically, or that the two halves 
of one crystal," differ considerably in values of internal friction, it 
becomes difficult to measure any frequency effects that are not masked 
by such variations. Even measurements at a series of natural fre- 
quencies (overtones) of one given specimen do not represent the 
properties of the same part of the lattice, since the maximum stress 
amplitude occurs in different parts for different harmonics. Observed 
variations with frequency are, therefore, different for different speci- 
mens, but the scatter seems to fall about a frequency independence in 
the case of the copper single crystals.“ For more highly cold worked 
polycrystalline materials, greater homogeneity seems to exist, and 
internal friction is reported®* to be independent of frequency. 

The temperature dependence of the curves of internal friction versus 
amplitude has also been studied for single crystals of copper in a limited 
temperature range." The internal friction and AM effect increase with 
temperature in this range of measurement, for which a tentative 
explanation has been advanced.!* Other observers report?®, 1%° 
different results for the temperature dependence. Further experimenta- 
tion along these lines will undoubtedly be fruitful. 

Surface conditions seem to have an effect on internal friction, but it is 
reported?*’, 4," that etching the surface results in a lowering of the 
internal friction curve as a whole without affecting the amplitude 
dependent part. 

The demonstration that amplitude dependent damping in single 
crystals at high frequencies originates in a mechanism of static hysteresis, 
and the close relation between this mechanism and strong amplitude 
dependence, means that it is always desirable to know whether damping 
is or is not strongly amplitude dependent. Unfortunately, many of the 
publications give no clue regarding this matter, either because the 
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apparatus used involved weak electromechanical coupling, so that all 
strain amplitudes are below 10-? where no amplitude dependence is 
observed, or because the experimenter did not check on the possibility 
of amplitude dependence. 


Study of a Heavily Cold Worked Material 

One investigation by K&"* and Zener™ deserves special attention, 
since it can be interpreted in terms of a mechanism which differs greatly 
from static hysteresis. The experiments differ in two ways from the 
single crystal investigations. First, the measurements are at low 
frequencies, i.e. obtained by means of a torsion pendulum. Second, the 


specimens (pure and commercial aluminium) are severely cold worked, 
by drawing them through dies to a 95 per cent reduction in area. The 
internal friction of these heavily cold worked specimens is independent 
of amplitude up to strain amplitudes of 10-°. It originates in anelastic 
behaviour, since creep produced by static stress is completely recover- 
able; furthermore, it varies with frequency and temperature as a 
function of the product we”/*?, with H equal to 31,000 cal/mole. The 
curve of internal friction versus frequency does not show a maximum, 
but climbs to very high values at low frequencies. This behaviour is 
completely comparable to that to be expected in the case of coupled 
relaxations of an irregular network of interfaces, discussed in the 
previous section. In the present case the interfaces are slip bands, which 
acquire a viscous behaviour due to the interlocking of the dislocations 
present in high concentrations in these bands." The irregular distribu- 
tion of the bands results in a coupling phenomenon, as discussed earlier. 
The interpretation in terms of a network of slip bands is in agreement 
with the observations of Woop'* that a heavily cold worked material 
is fragmented into crystallites. We may regard the slip bands as the 
crystallite boundaries. 


Discussion in Terms of Dislocation Theory 


Most of the important facts concerning the internal friction of plastic 
origin have been discussed, and the reader, reflecting upon them, will 
undoubtedly agree that many careful experiments remain to be per- 
formed before a detailed understanding of the atomic mechanism of this 
damping will be possible. An attempt at interpretation of the measure- 
ments in terms of dislocation theory must, in the present stage, be 
somewhat speculative. On the other hand, new experiments are 
generally the outgrowth of speculation. 

In two investigations progress has been made toward an under- 
standing of the mechanism of damping, at least from a phenomenological 
viewpoint. These are the investigations of amplitude dependent 
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damping in single crystals of copper at high frequencies, and the 
study of heavily cold worked aluminium at low frequencies. Let us 
start with the latter. Here a satisfactory explanation may be given in 
terms of slip bands or crystallite interfaces. The behaviour of regions 
that are very dense in dislocations is not conveniently discussed in 
terms of dislocation theory, for the same reason that it is not profitable 
to discuss large angle grain boundaries as made up of dislocations: the 
identity of the individual dislocations is lost. The study of heavily cold 
worked material seems to fit best into the category of damping resulting 
from stress relaxation across incoherent interfaces. The fact that the 
observed heat of activation for the damping in heavily cold worked 
aluminium is the same as that for grain boundary relaxation in that 
material, within experimental error, is further evidence that the two 
effects are basically the same. The grain boundaries, of course, repre- 
sent a more regular collection of interfaces than the slip bands of a 
heavily cold worked metal, and, therefore, produce an internal friction 
peak. Annealing at low temperatures of the heavily cold worked material 
may result in lowered damping due to the production of greater regu- 
larity in the crystallite boundaries without appreciable growth of 
crystallites (which would lead to recrystallization). 

Now let us turn to the behaviour of slightly cold worked materials, 
where the properties of individual dislocations may be more readily 
discussed. As already pointed out, we must conclude, from the easy 
room temperature recovery of cold work internal friction, that the type 
of internal stress responsible for internal friction must differ from that 
which produces recrystallization and hardness, in that it is less stable 
or more mobile. Translated into dislocation language we may say that, 
where as dislocations tightly bound’in internal interfaces are responsible 
for gross plastic flow, the internal friction in a material that is not too 
heavily cold worked is associated with relatively free dislocations. We 
may imagine that during low temperature annealing these free disloca- 
tions are not lost from the lattice but simply become bound, either to 
impurity atoms or to other dislocations (presumably in internal surfaces). 
The ease with which recovery takes place, as indicated by its occurrence 
at room temperature and by the fact that the heat of activation for the 
process is small]!** relative to corresponding values for volume diffusion, 
shows that the recovery process does not involve atomic diffusion. If 
recovery involves the locking of dislocations by impurity atoms, it is 
the dislocations that move to the impurity atoms, rather than vice 
versa. The occurrence of a mechanism of static hysteresis in high 
frequency measurements is now readily interpreted. The static hystere- 


sis loop is a consequence of a certain number of relatively free disloca- 


tions being torn loose from the smal! barriers that hinder their motion, 
for each increment of stress do. The stress-strain loop is then really a 
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step-like affair, each step corresponding to the burst of a single disloca- 
tion, which moves almost instantaneously from the original barrier to 
another position where its motion is again interrupted. Each such dis- 
location burst produces non-elastic strain. Once a dislocation is pulled 
loose it will not generally return to its original position unless a reverse 
stress is applied. The magnitude of an individual step that constitutes 
the stress-strain loop is a small fraction of an atomic distance, too 
small to be observed as a step. The stress-strain loop that has been 
described is analogous to the magnetic hysteresis loop, in which sudden 
movements of magnetic domain boundaries are induced by an applied 
magnetic field in the same way as dislocation bursts are induced by an 
applied stress. 

A qualitative estimate of the dependence of internal friction on the 
degree of cold work may be given in terms of the present interpretation. 
The damping is proportional to the energy loss per cycle AW (equation 
(27)), and the contribution to this energy loss, in that part of the cycle 
where the stress is increased by an amount do, may be represented as 
dW. This latter quantity is proportional to the increase in non-elastic 
strain corresponding to the stress increment, which in turn is propor- 
tional to the product of the number of dislocations moved dN and the 
mean distance moved LZ. In the form of an equation, 


dAW = L.dN 


It now seems reasonable, that in successive stages of cold working 
starting from a well-annealed material, the major increase is, at first, 
in the term dV, while L remains essentially constant; more free disloca- 
tions are made available, but the distance that each may move in one 
burst is fixed by imperfections already present in the lattice. As cold 
working progresses, the crowding of dislocations soon begins to place a 
serious limit on L, which begins to decrease rapidly when the density 
of dislocations is large enough so that they begin to hinder each other's 
motion. The quantity dN cannot become indefinitely large to counter- 
act this decrease in L, since the tendency for relatively free dislocations 
to become “bound dislocations”’ will limit the number of free disloca- 
tions available. This argument predicts a maximum in the product 
L .dN, and, therefore, in the internal friction, as a function of the 
degree of cold work. The concept of dislocation bursts has, therefore, 


provided a reasonable explanation for the observed maxima in the 


curves of damping versus cold work. 

Amplitude dependent internal friction has only been studied, and its 
mechanism established as one of static hysteresis, at high frequencies. 
No systematic low frequency measurements for well annealed materials 
and for small amounts of cold work have been obtained. (This lack of 
data is partly a result of the fact that low frequency measurements of 
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small values of internal friction are difficult, because the equipment 
itself has thus far been responsible for a residual damping of about 10-+.) 
If the dislocation mechanism just described is correct, however, we 
may anticipate that, given enough time (i.e. at sufficiently low fre- 
quencies) or at higher temperatures of measurement, it should be 
possible for a relatively free dislocation to pull away from its barrier by 
thermal activation, without having to wait for the applied stress to 
become sufficiently large to free it. In other words, the mechanical 
behaviour which is describable in terms of static hysteresis at high 
frequencies may completely change its character to a viscoelastic type 
of mechanism at low frequencies. One model that may be presented is 
that, at low frequencies, instead of dislocations being torn loose from 
impurity atoms that restrain them, they may drag the impurity atoms 
along with them. This behaviour would lead to a viscous creep involving 
a permanent set, since dislocations that have moved have no reason to 
return to their original positions unless a reverse stress is applied. The 
low frequency internal friction may be considerably less amplitude 
dependent than the high frequency damping, and it may show a strong 
frequency dependence, if this interpretation is correct. In fact, the 
simplest description of such behaviour would be the Maxwell solid, 
which represents elasticity plus viscous flow, and leads to an inverse 
frequency dependence (equation (12)). Unfortunately there are no low 
frequency damping measurements which can be examined to see if 
these ideas are correct. There are static measurements, however, which 
agree very well with the above interpretation. These are the measure- 
ments of “‘microcreep”’ of single crystals of tin, by CHaLMErs,!** which 
show that for very small stresses a limited amount of creep occurs. The 
observed initial creep rate is proportional to the applied stress, and the 
rate of creep, for any given stress, approaches zero with time. This 
microcreep is not recoverable in single crystals. It has been attri- 
buted’. 134 to the movement of relatively free dislocations which drag 
impurity atoms along with them. On the basis of this evidence it 
therefore seems logical to suppose that internal friction at high 
frejuencies and microcreep may be traced to the same origin: the 
movement of relatively free dislocations which are torn loose by the 
stress when the frequency is high, or which surmount the barrier to their 
movement by an activated process at low frequencies or under static 
conditions. Although this description provides a semblance of order in 


the interpretation of phenomena associated with the movement of 
dislocations at low stress levels and for small degrees of cold work, it 
should be remembered that there is a large unfilled gap between the 
high frequency measurements and the static “‘microcreep” measure- 
ments. Furthermore, the former has been studied only in the case of 
copper crystals and the latter effect only in the case of tin crystals. 
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further experiments are required to see if the predictions are 
substantiated 
THERMOELASTIC INTERNAL FRICTION 


Che sources of internal friction considered up to this point have all 
involved some form of atomic re-arrangement, induced by stress. In 
the present section, damping originating in stress-induced thermal 
sriations will be considered. The basic principles involved in this type 
if internal friction have been understood for some time and have been 
quately reviewed. ! The present discussion will briefly summarize 
principles 
effect, we mean the coupling between internal 
thermal) and mechanical co-ordinates of a material, as illustrated by 
the expansion of a solid when heated, and the reciprocal effect, the 
ooling during an adiabatic expansion. An element of material, to 
which a small stress is suddenly applied, will instantaneously be 
strained by an amount ¢’, and this strain will be accompanied by a 
change in temperature. If the stress is homogeneous throughout the 
specimen, the same temperature change will occur at every point. On 
the other hand, a non-homogeneous stress sets up a temperature gradient 
» material, and, therefore, allows heat flow to take place. The flow 
‘at into or out of a given element of the material produces an 
uiditional non-elastic strain. In general, a conversion of mechanical 
energy into heat, or internal friction, will be a result of this heat tlow 
ind the associated non-elastic behaviour 
[f a periodically varying non-homogeneous stress is applied to the 
t periodically varying temperature gradient is produced. 
iriations in the stress are so rapid, i.e. the frequency so high, 
insufficient time is allowed for ippreciable heat tlow to take place 


during a stress cycie the process 1s adiabatic and no energy loss. or 


damping, takes place. Under such conditions the dynamic modulus is 


the modulus /’, the quotient of the stress by the instantaneous strain 
equation (9 The quantity ©’ is, in this case, known as the 
uliabatic modulus. In the other extreme, at sufficiently low trequencies, 
temperature equilibrium in the specimen is constantly maintained, or 
the process is isotherma The transfer of mechanical energy into heat 
takes place reversibly, so that again there is no mecha 
energy iost ‘ e heat generated per cycle Che dynamic 
modulus measured under these conditions is called the isothermal 
modulus and is lower than the adiabatic modulus. In the intermediate 
range of frequencies, where the ynditions of ipplication ot stress are 
neither adiabatic nor isothermal, the conversion of mechanical energy 
into heat is not reversible and internal friction effects may be observed. 
There are two categories of thermoelastic effects, depending on the 
origin of the non-homogeneity of stress in the specimen. 
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Macroscopic Thermal Currents 
The thermoelastic effect may result from non-homogeneity of stress that 
is inherent in the type of vibration. This phenomenon is demonstrated 
most strikingly for the case of the transverse vibration (flexure) of 
reeds. In this mode of vibration the material that is on the outer or 
convex side at any instant is expanded and, therefore, cooled, while 
that on the concave side is compressed and raised in temperature. Heat 
flow will, therefore, occur across the specimen, if sufficient time is 
allowed. As already pointed out, the thermoelastic internal friction 
shows a maximum as a function of frequency, since very smal! damping 
occurs at either very low frequencies (the isothermal limit) or at very 
high frequencies (the adiabatic limit). It can be shown® that the case 
of a reed in transverse vibration is describable, very nearly, in terms of a 
single time of relaxation (i.e. as a standard linear solid), so that the 
internal friction peak is described by equation (39). The time of 
relaxation 7 is given by 

d2/x2D, > o oh) ee 


where d is the thickness of the reed and D, the thermal diffusion vonstant, 
which is 
thermal conductivity 


D, 


(specific heat) (density) 

The relaxation strength A, is the fractional difference between the 
adiabatic and isothermal moduli; it ranges between 10-* and 10-* for 
various materials. The measurements of BENNEwITz and ROTcGER™ 
and of Zener, Otis and Nucko.is* have verified the theory in all 
details 

Macroscopic thermal currents may also occur, at least in principle, 
in a specimen vibrating in a longitudinal mode, where heat flow is to be 


expected between regions of compression and of dilatation. The dis- 


tances between these regions, and, therefore, the time of relaxation 
involved, is too large for the effect to be of any significance when 
specimens are vibrated in the ordinary range of frequencies. We may, 
therefore, always regard longitudinal vibrations as adiabatic with 
respect to macroscopic thermal currents 

No thermoelastic effect results from a non-homogeneous shear stress, 
as in torsional vibration. because shearing is not accompanied by a 


change of temperature 


Microscopic Thermal Currents 


In addition to the possibility that non-homogeneity oi stress may 
result from the type of vibration employed, a polycrystalline metal also 
offers the possibility of fluctuations in stress from grain to grain, even 
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when the applied stress is macroscopically homogeneous.’ This micro- 
scopic non-homogeneity is a consequence of the elastic anisotropy of an 
individual grain, combined with the fact that grains are randomly 
oriented in a polycrystalline metal. Fluctuations in stress from one 
grain to another gives rise to heat currents between adjacent grains, and, 
therefore, to an internal friction peak analogous to that obtained as a 
result of macroscopic thermal currents. The peak obtained as a result 
of intergranular thermal currents does not represent a single time of 
relaxation, however, and the exact form of this curve is not readily 
evaluated theoretically. We may expect from dimensional reasoning,” 
that internal friction will be a function of the frequency, the mean grain 
size d,, and the thermal diffusion constant D,, only through the product 
wd,/D,. The maximum in the internal friction may be expected when 
this product is close to unity. These predictions are verified by the 
experiments of ZENER and RanDALu.”,. © 

An awareness of the internal friction that originates in thermal 
currents is important when one studies the damping due to other 
sources, if polycrystalline materials are being used, or if the specimen 
is vibrated transversely. Under either of these circumstances, it is 
desirable to work in a frequency range where the vibrations are either 
isothermal or adiabatic with respect to thermal currents, to avoid this 


type of internal friction. 


MAGNETOELASTIC INTERNAL FRICTION 


The coupling between magnetic and mechanical properties of a ferro- 
magnetic material provides another important source of internal fric- 
tion. This “magnetic damping’”’ is closely related to other ferromagnetic 
phenomena and has, therefore, been reviewed in some detail in the 
general treatises on ferromagnetism by Becker and Dérine™ and by 
Bozortu.* The present discussion is a brief résumé of the factors that 
are responsible for internal friction in ferromagnetic materials. 

Magnetoelastic phenomena are best discussed in terms of the domain 
theory of ferromagnetism, according to which a ferromagnetic substance 
is composed of small “domains” that are individually magnetized to 
saturation. In the demagnetized state of a specimen, there is a random 
distribution of the magnetization vectors of the domains among the 
various crystallographic directions of ‘easy magnetization’’. The effect 
of a magnetic field is to produce rotation of the directions of magnetiza- 
tion and movements of domain boundaries so as to align the magnetiza- 
tion vectors as closely as possible to the direction of the field. 

The coupling between magnetic and mechanical properties is provided 
by the phenomenon of magnetostriction, which usually manifests itself 
through the elongation of a material in the direction of magnetization. 
(The opposite effect, a decrease in length of a magnetized specimen 
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represents ‘“‘negative magnetostriction”.) From the appropriate 
reciprocity relation, it follows that application of stress may produce a 
change in the state of magnetization. The effect of stress is, therefore, 
somewhat analogous to that of a magnetic field, in that stress can also 
produce rotation of the magnetization vectors of domains and movement 
of the domain boundaries. The principal difference between the action 
of stress and of a magnetic field is that in a field there is a difference in 
the energy associated with a magnetization parallel and antiparallel to 
the field, while stress is incapable of distinguishing between two 
directions of magnetization 180° apart. 

As a result of magnetoelastic coupling, the effect of stress is to pro- 
duce, in addition to the elastic strain, a magnetostrictive strain. The 
occurrence of this non-elastic strain leads to both internal friction and 
to what is called, in the present case, the AZ effect. At high fields, 
sufficient to align all domains in the direction of the field, the magneto- 
strictive strain vanishes. The value of Young’s modulus measured 
under these conditions, i.e. when the material is magnetized to satura- 
tion, is simply the ratio of stress to elastic strain e’ and may be desig- 
nated in the usual notation (see equation (9)) as Z’. The difference 
between EF’ and the value of Young’s modulus measured at zero 
magnetization is the well-known “AZ effect.’’ The concept of a AM 
effect, which appears throughout this article, is a natural generalization 
of this AZ effect in magnetic materials. We will see shortly that 
Young’s modulus of a ferromagnetic specimen depends not only on the 
field strength but also on the vibration amplitude. 

The work of several authors", 1“. 141 has led to the separation of 
sources of internal friction in magnetic materials into three types: 
macro-eddy currents, micro-eddy currents, and static hysteresis. These 
will now be discussed individually. 


Macro-eddy Currents 

The sudden application of a tensile stress to a partially magnetized 
specimen will produce a change in the magnetization. Such a change 
induces surface eddy currents which produce an additional field inside 
the specimen such that the total flux inside is unchanged. Associated 
with these currents are resistive energy losses. The surface eddy 
currents gradually diffuse inwards thereby allowing the magnetic field 
strength in the specimen to return to its original value. This diffusion 
of eddy currents, and, therefore, of magnetic flux, may be regarded® as 
analogous to the thermal! diffusion which produces thermoelastic internal 
friction. It results in a peak in the curve of internal friction versus 
frequency; this peak is not expressible as a single time of relaxation, 
however. The damping depends on the thickness of the specimen d, the 
resistivity p, and the angular frequency » only through the quantity 
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d*m/p. In the low frequency limit, damping is proportional to this 
quantity, and in the high frequency limit, the damping varies inversely 
as the square root of this quantity. This type of internal friction is a 
typical example of anelasticity, and shows the usual independence of 


vibration amplitude. There is no damping due to macro-eddy currents 
in a demagnetized specimen, due to the fact the application of stress to 
such a specimen produces no gross magnetization. (This fact is a 
consequence of the inability of stress to distinguish between two 
directions 180° apart; if no preferential direction of magnetization 
exists at zero stress, a finite stress cannot create one.) These charac- 
teristics (the peak obtained as a function of frequency, the amplitude 
independence, and the vanishing of the effect in a demagnetized 
specimen) distinguish the internal friction due to macro-eddy currents 
and make it possible to separate this type of damping from the others. 


Micro-eddy Currents 


Although the application of stress to a demagnetized specimen produces 
no gross magnetization, the fact that there are local variations of 
magnetization within the material, due to the domain structure, means 
that stress will produce local changes of magnetization. These local 
magnetization changes will add vectorially to zero, but the ‘“‘micro-eddy 
currents” that accompany them will be associated with finite total 
energy losses. Like the macro-eddy current case, the damping that 
results from micro currents is independent of vibration amplitude and 
strongly frequency dependent. The two cases differ, however, in that 
micro-eddy currents produce losses even at zero magnetization, and that 
these losses are proportional to the frequency over a wide range of 
frequency, i.e. internal friction does not go through a peak. Further- 
more, this source of damping is highly structure sensitive ; according to 
a calculation of Becker and Dorrnea," internal friction varies inversely 
as the square of the magnitude of internal stresses. In qualitative 
agreement with this theory, various authors show". ' that this type 
of damping is reduced by cold working and increased by annealing. 


Magnetoelastic Static Hysteresis 

When measurements are taken at sufficiently low frequencies, eddy 
current losses are effectively eliminated, because the rate of application 
of stress and, therefore, the rate of change of magnetization is very 
small. Under these conditions we obtain a stress-strain hysteresis loop 
analogous to the hysteresis loop of magnetization versus field strength ; 
in both cases the occurrence of hysteresis is associated with sudden 
movements of domain boundaries which produce, irreversibly, a change 
in the size of one domain at the expense of another. This type of stress- 
strain loop is another example of the behaviour that we have termed 
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static hysteresis. Characteristic of such behaviour is frequency indepen- 
dence, and a strong dependence of damping on strain amplitude even 
at low amplitudes of vibration. This type of magnetic damping is 
discussed in detail by BoULANGER.*” 

The similarity in the origin of the magnetization-field strength 
hysteresis loop to that of the stress-strain loop means that the Rayleigh 
law, which states that magnetic hysteresis losses are proportional to the 
cube of the maximum field strength, applies equally well to the mecha- 
nical hysteresis losses. The energy dissipated per cycle should, then, be 


AW, 
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Fig. 11. Internal friction of Armco iron as a function of amplitude 
of vibration, at zero field and at saturation. Points indicated by 
different symbols represent different vibration frequencies 


proportional to o,°, where o, is the stress amplitude of vibration. Since 
the vibrational energy is proportional to o,*, we find (from equation 
(27)) that the internal friction is proportional to o,. This linear ampli- 
tude dependence of damping at low amplitudes has been verified by 
FOrsTER and KOsTER™ and by SNoek.! Correspondingly, it is shown! 
that Young’s modulus decreases linearly with o,. This result again 
shows that for a mechanism of static hysteresis, both the damping and 
AM effect show the same form of amplitude dependence, as expected 
from equations (49). As the vibration amplitude continues to increase, 
the energy dissipated per cycle must deviate from Rayleigh’s law and, 
finally, approach a constant value for stress amplitudes sufficiently 
large to produce a condition of saturation. Correspondingly, the 
internal friction, which initially obeys a first power law must go 
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through a maximum as a function of amplitude and, finally, obey an 
inverse square law. The measurements of BouLANGER, shown in Fig. 11, 
demonstrate this behaviour. From the difference between the internal 
friction curves for zero field and for saturation, the energy loss AW,,, 
due to ferromagnetic effects, can be calculated (the dashed curve of 
Fig. 11). This curve is initially a cubic law and eventually approaches a 
constant value at large amplitudes. The fact that essentially the same 
internal friction curve is obtained at different frequencies guarantees 
that eddy current losses do not contribute to the observed damping. 


OTHER Sources OF INTERNAL FRICTION 


The sources of damping already discussed by no means exhaust all 
possibilities, but are simply the ones that have been most thoroughly 
investigated up to the present. The ability of stress to induce atomic 
re-arrangements of various kinds seems to offer wide opportunities for 
further application of internal friction methods, particularly in the 
study of phase transformations. Such applications have, thus far, been 
only of a preliminary nature. 

An important type of phase transformation is the precipitation of a 
second phase, which occurs by a process of nucleation and growth. We 
have already discussed the effect of precipitation on internal friction 
when precipitation is discontinuous and results in the formation of 
internal interfaces. Studies of the precipitation of interstitial solutes in 
b.c.c. lattices, in terms of the rate of disappearance of solute atoms from 
solid solution, have also been discussed. No mention has been made, 
however, of any internal friction produced by the precipitate particles 
themselves, in the early stages of continuous precipitation. Such effects 
have been studied, for the most part, on age hardenable engineering 
materials; internal friction, as measured at one frequency and at room 
temperature, may then be regarded as another mechanical property 
(like hardness) which varies with the degree of precipitation. On the 
other hand, fundamental studies of such damping effects, in relatively 
simple alloys, may reveal valuable information about the nature of the 
early stages of precipitation, that cannot be obtained by other methods. 
An investigation by Ké™* reveals an “anomalous” internal friction in 
aluminium containing 4 per cent copper, which he claims is related to 
the precipitation process and to the presence of a state of cold work. 
Further experiments by the writer on this alloy indicate that cold 
working may not be necessary to this effect, and also show that the 
“anomalous” damping is difficult to reproduce. This phenomenon 
seems to be a sensitive measure of one stage in the precipitation 
process. 

Internal friction methods may also be of value in the case of phase 
transformations that do not occur by nucleation and growth. If the 
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transformation is accompanied by a change of dimensions, the applica- 
tion of stress to the material will induce or retard the transformation at 
temperatures in the immediate vicinity of the transformation tempera- 
ture. Stated differently, the lattice is relatively unstable near the 
transformation temperature and may be greatly influenced by an 
external stress. Large non-elastic deformation, or plasticity, and large 
internal friction effects are, therefore, expected in the vicinity of the 
transformation temperature. Such behaviour has been observed for the 
6 to { transition in Ag-Zn alloys,*’ for the A-point transition in ammo- 
nium chloride,’ and for the « to y transformation in ferronickels.*: 1 
The mechanism of this type of internal friction is, of course, closely 
related to the mechanism of the particular phase transformation, and, 
therefore, deserves further detailed investigation. 
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THE MECHANISM OF OXIDATION OF 
METALS AND ALLOYS AT HIGH 
TEMPERATURES 


Karl Hauffe* 


INTRODUCTION 


THE reaction between a solid metal and oxygen or an oxidizing gas 
(e.g. a halogen or sulphur) seems to be one of the simplest chemical 
reactions. Experiments show, however, that the mechanism is rather 
complex, because in many cases the reaction product separates the 
reactants and forms a compact layer of oxide or halide on the metal 
surface. The oxidation can advance only as fast as the reactants diffuse 
across the oxide layer. It is therefore of interest to investigate the 
structure of these layers and the mechanism whereby atoms, ions and 
electrons diffuse in them. Assuming that there is no diffusion through 
pores or along grain boundaries, WaGNER” made the working hypo- 
thesis that only cations and anions migrate across the compact layer, 
and that the migration of electrically neutral atoms or molecules may 
be neglected. In the next chapter we shall consider the laws of motion 
of ions and electrons within the crystal. We shall see that WaGnErR’s 
theory of tarnishing reactions’”*-*". * and its applications to alloys by 
WacGner®™ *® and Havurre and associates*. *!. 2°. 21 has some success in 
explaining experimental observations. 

The interconnexion of diffusion processes and phase boundary 
reactions is pictured in Fig. 1. Sometimes, the reactions at the phase 
boundaries may be rate-determining factors. In most cases, however, 
the migration of ions controls the rate of oxidation. On this basis it may 
be assumed either that metal ions and electrons migrate from the inter- 
face between metal and oxide across the oxide to the outer surface or 
that oxygen ions migrate in the opposite direction. 

Reactions of metals and alloys with oxygen, sulphur and halogens at 
high temperatures are called tarnishing reactions. We must distinguish 
between oxidation processes leading to thick oxide layers and those 
leading to thin (or very thin) films. In the first case it is to be expected 
that the rate of oxidation is inversely proportional to the thickness of 
the oxide layer (or proportional] to the square root of the time) and that 
electroneutrality holds. Under these conditions WaGNER’s theory is 


* At present Professor of Physical Chemistry at the University of Berlin, Germany. 
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valid. At the other extreme, in very thin films, there is a diffuse electric 
double layer, where the electrical field strength is so high that the rate of 
migration of ions is no longer proportional to the field strength, i.e. it is 
no longer inversely proportional to the thickness of the oxide film, but 
depends quasi-exponentially upon the thickness. Mott® was the first 
to show that under these conditions the rate of oxidation will decrease 
more rapidly with increasing thickness of the oxide film than is accounted 
for by the parabolic law. This is one of the various ways of interpreting 
the so-called logarithmic rate law. In Chapter 3.1 the formation of thin 


Fig. 1. Migration processes 
and phase boundary reactions 
during the oxidation of a 
metal forming a defect semi- 
conducting oxide layer. (Here 

and Mel.’ represent an 
electron hole and a metal ion 
vacancy, respectively. The 
expression ‘‘electron hole’’ in- 
dicates that somewhere in the 
lattice of the compound an 
electron is missing. Fig. |! 
shows that electrons migrate 
by filling electron holes and 
ions migrate by filling ion 








vacancies) 


films will be discussed in general, using the theories of CABRERA and 
Morr’ and ENGELL and Havcrre.” 

For thick layers, where the concentrations or activities of the 
reactants at the two boundaries of the oxide layer are independent of 
time, and thermodynamic equilibrium at the respective interfaces is 
practically established, the reaction rate obeys Fick's law, giving the 
increase of thickness Ar per unit time f¢, 


d( Az) k r 
— 

dt Ax 
where &’ is a constant. Upon integration, 


Instead of measuring the thickness it is often easier to determine the 
increase of mass in grammes per cm? Am/A. Equation (2) then leads to 


= 2k’ —oe (3) 


Am \? 
i=} 


This is the parabolic rate law, which was first derived by TAaMMANN® in 
1920 and independently by PrLtinec and BeEDWorRTH® in 1923. 
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THE FORMATION OF THICK Compact LAYERS OF OXIDE 
ON A METAL SURFACE 


A compact layer is formed, if the molar volume of the oxide is equal 
to or greater than that of the metal. This general rule is only a rough 
one and requires modifying in view of experimental results. The 
theoretical basis for understanding the departure from this rule has 
been given by FRANK and VAN DER MERWE.!’ According to these 
authors it is reasonable to assume that, if the degree of misfit is less 
than about 15 per cent, the layer will take up the lattice parameter of 
the metal. If it is, however, greater than about 15 per cent, it will take 
up very nearly its own unstrained lattice parameter. In the latter 
case the layer can break away from the surface or become porous. 
In the first part of this report we shall confine ourselves to thick 
compact layers. In 4-3 thin layers will be dealt with. 


Lattice Defects in Ionic Crystals 


Motion of ions in a crystal lattice is understandable, if one assumes, as 
FRENKEL has done, that not all ions are in their normal lattice positions.!* 
In pure silver halides some cations have 
left their normal lattice points to occupy 
interstitial positions; in this way one 
cation vacancy appears per interstitial 
ion, as is shown in Fig. 2. The anion 
lattice is practically undisturbed. In a 
concentration gradient or a weak elec- _. soy 
trical field the silver ions can move via — . Lattice omvete fe ata 
‘ yromide. According to FRENKEL. 
interstitial positions or via vacancies. (yacant silver sites indicated by 
FRENKEL’S assumptions well explain a square) 
experimental results. TvuBaNDT showed 
that, in silver halides, practically only silver ions are mobile.” * 
Furthermore, energy calculations led Jost®!: ** to the conclusion that 
for the silver halides less energy is required to move a silver ion from a 
normal to an interstitial site than to form a pair of vacancies (proposed 
by Scuottky for alkali halides and discussed below) by moving a cation 
and an anion to the surface. More detailed calculations by Mott and 
LitTLETON®™ somewhat modified this. Experiments involving the 
addition of the halides of divalent metals to silver halides, first carried 
out by Kocu and WaGNer,® have confirmed JostT’s viewpoint. 
TELTOW” has repeated these experiments with a considerable increase 


Ag* B 
Q 


in experimental accuracy and also in the number of determinations 
with additions of divalent metal salts such as the halides of lead and 
cadmium. 

Fig. 3 shows another type of defect, first discussed by Scnotrky.” 
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Under the influence of thermal motion cations and anions near the 
surface of the alkali halides leave their normal lattice points diffusing 
to the surface and leaving vacancies behind. These vacancies can 
move in the lattice and their existence thus makes the normal ions more 
mobile. Experiments to determine transport numbers show that the 
conductivity is entirely ionic in character and due to the migration of 
both cations and anions. The temperature 
coefficient of mobility will usually be dif- 
ferent for the two ions. The energy required 
to form a vacancy was first computed by 
Jost and NEHLEP*® and later investigated 
wie & Lattice defects tn by more refined methods by Morr and 
sodium chloride. Accord. LITTLETON. ScHotrkKy defects seem to be 
ing to SCHOTTKY the rule also in some other crystals, e.g. 
in lead halides and some sulphides and oxides. 
These types of lattice defects are only to be found in stoichiometric 
crystals. The WacNner-Scuotrky disorder theory of crystals explains 
experimental observations and, moreover, its application led to further 
fruitful experiments. Thus WaGNeER and co-workers*' have developed 
the basic principles of ionic crystals possessing deviations from the ideal, 
integral, stoichiometric composition. In solids like Cu,O there are 
cation vacancies Cul)’ and electron holes ©. From the corpuscular 
viewpoint, therefore, an electron hole is represented by a Cu** ion (see 
Fig. 4). In the presence of oxygen, cuprous ions diffuse from the interior 
to the surface, there combining with chemisorbed oxygen which, on 
account of its electron affinity, has 2, o¢ 
captured electrons, forming more Cu,O 
molecules at the surface. It is, there- 
fore, evident that the number of Cu’- 
vacancies and electron holes increases 
with increasing oxygen pressure. Since pig 4. Lattice defects in 
in most cases the concentration of cuprous oxide. According to 
lattice defects is relatively low, one WAGNER. Squares indicate 
may tentatively apply the laws of vacant copper ion lattice pointe. 
: , A divalent copper ion repre- 
ideal solutions. Under these assump- sentine 6th dhnatinats, tale 
tions we can write 
10, = Cu,O0 + 2Cu 
Applying the law of mass action we have 


ca = K . Po,’ : ° . . . (5) 


where K is the equilibrium constant and c, is the concentration of 
electron holes, which is equal to the concentration of cuprous ion 
vacancies ¢,... Since the electrical conductivity « is proportional to 


Ce, we have . . 
° x = const . po,’ aay ap 
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WaGNeER and associates®: 2” find « < po,’ in good agreement with 
theory. GUNDERMANN and WaGNER* showed that the conductivity 
due to cuprous ions is also proportional to po,.’. NiO behaves in a 
similar manner. Semi-conductors in which the current is carried by 
electron holes are called electron-defect semi-conductors (or p-semi- 
conductors). Such conductors are Cu,0, NiO,’ FeO,** CoO,® Bi,O,,°” 
Cr,0,.33 

Solids containing an excess of metal are in distinct contrast to the 
above group of oxides. A representative of this group is zinc oxide. 
An excess of metal may be present in the form of zinc ions in interstitial 
positions and an equivalent number of quasi-free electrons © as shown 
in Fig. 5. At high temperatures ZnO dissociates, as follows 


ZnO = ZnO’ + © + 40, fae 


ZnO = ZnO” + 29 + 40, 


In these equations ZnO’ and ZnO”, Zn** O° 
respectively, represent an _ interstitial 
zine ion, the point to the right of the 
circle indicating the positive net-charge. 
According to MILuerR’s self-diffusion 
measurements at 1000°C® the assump- Fis. 5. Tentative scheme of 
tion of the existence of interstitial zinc (‘tice defects in zinc oxide as 
, ; - , : proposed by WAGNER 

is better justified than is the existence 

of oxygen ion vacancies. Again applying the law of mass action we 
obtain for the electrical conductivity 


Zn** 0* Zn** 0* Zn** o* 


k = const. po, ** ore 
and -1/6 , 
K = const. Po, respectively cao «6 Ce 
In agreement with (8) it is found experimentally that the electrical 
conductivity of zinc oxide decreases with increasing partial pressure of 
oxygen at 600°C?; «x apo ~'* to po’. CdO,? TiO,,% * Al,0,” 
behave similarly. The special problems encountered in interpreting 
the observed properties of intrinsic semi-conductors such as CuO* will 
not be discussed, because it is not yet known what type of lattice 
defects prevail in this case. 


Wagner's Theory of Oxidation of Metals 


Following WAGNER we shall now derive the general formula of high- 
temperature oxidation of metals, which expresses the rate of oxidation 
in terms of the free energy of oxidation, the electrical conductivity, 
and the transport numbers of ions and electrons. Assuming that the 
ions and electrons migrate independently of each other, one obtains 
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lor u,, the average drift velocity in cm .s~' of particles of type i in 
the direction of the z-axis 


Y) 


where B, represents the mobility of particles of type i, i.e. the steady 
state velocity due to a unit force per particle, corresponding to a 
change in the chemical potential per particle of one unit of energy 
| erg) for a displacement of one unit of length | lem). WN is 
\vogadro's number, e is the electronic charge, ¢ is the local electrical 
potential, z, is the electrical valency and wu, is the chemical potential 

of a mole of particles of type + in ergs per mole. 
If x, A is the number of moles migrating per unit cross section per 


unit time, then there follows from equation (9) 


ie 5 : = 10) 
where c, is the concentration of particles of type + in moles per unit 
volume 1 em*). 

As it is often desirable to eliminate B we introduce the electric 
current / (in amperes) and the electric conductivity « (ohm~' . cm~") in 
combination with the electric field strength (dé/dz) in absolute units 


und the factor 300 to convert to practical units by means of the relation 


dd 
Axn300 = <- «ae 


In the case of electrolysis du dr is zero, and we obtain for the transport 
of 10ons 
300 dé 
: y 12 
96500 , dz 
where (, is the transport number of particles of type 1. Comparing 


equations (10) and (12) we get 


13) 


n, 300 - i? <8) 
2.é! 
A 96500 z eN da 
In view of the chemical equilibrium between ions, electrons and elec- 


trically neutral atoms, we have 
l4a) 
146 


where uy, and uw. are the chemical potentials of the electrically 
neutral reactants, the metal Me and the non-metal X, respectively 
These chemical potentials uy, and «, are not independent of each 
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other. According to the special form of the Gibbs-Duhem equation, 
Moyg CA fgg, nau, 0, 


we have in the case of a small deviation from a stoichiometrically 
composed solid 


ditty ~ — du ; (15) 


On the basis of these equations WaGNer derives the following 
equation for the formation of an oxide, halide or sulphide 


n { 300 1 f* } } J 
te )tgxd 1155 


equ 


A \|96500° Ne Jy. x 


where 7,,,,/A is the rate of both outward migration of metal ions and 
inward migration of non-metal ions expressed in equivalents per 
centimetre per second, Az is the instantaneous thickness of the reaction 
product in centimetres, e is the electronic charge in absolute electro- 
static units, z, is the valence of negatively charged anions, yu, is the 
chemical potential of the non-metal in ergs per gram atom, where a 
single dash and the double dash following the x, indicate the equilibrium 
between the reaction product and the metal and between the reaction 
product and the non-metal, respectively; ¢,, t, and ¢, are the transport 
numbers of metal ions, non-metal ions and electrons, respectively, and 
the constant /, is called “‘rationa] rate constant,”’ i.e. the reaction rate 
in equivalents per centimetre per second for a layer of the reaction 
product of 1 cm thickness. 

Recently WaGNeErR* has derived a formula, similar to equation (16), 
which contains only the self-diffusion coefficients D,* and D,* of metal 
and non-metal ions, respectively. Using ErNsTEern’s equation, 


D* = BAT con, oe 


WAGNER finally obtains the following expression for the rational rate 
constant k. 


D,* )dina, 


if ¢ 


c| “ (p,* + = D,*) dinay, 
Jay, zy 

where c = ¢,/z, C./z_ is the concentration of metal or non-metal ions 

in equivalents per cubic centimetre and a,,, and a, are the correspond- 

ing activities. This equation, however, is fulfilled only if the transport 

number of electrons ¢, is practically equal to unity 
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Both equations (16) and (18) are useful for the calculation of the 
rate constant of oxidation reactions. To use equation (16) the electrical 
conductivity and the transport numbers of ions and electrons as 
functions of the partial pressure of the non-metal are required. Use of 
equation (18) is recommended if the determination of self-diffusion 
coefficients is easier than the determination of transport numbers. 

As already pointed out by Havurre,”: * on the basis of WaGNeER’s 
theory of oxidation, the chemical viewpoint, i.e. the distinction between 
oxidation, halogenation and sulphuration reactions, is not of para- 
mount importance. We must, however, clearly distinguish between 
oxidation processes in which the intermediate layer is mainly an 
electronic conductor and those in which it is mainly an ionic conductor. 
In the former case we must also distinguish between conductors in 
which the current is carried by quasi-free electrons and conductors in 
which the current is carried by electron holes. This distinction is of 
decisive importance for tarnishing reactions. 

According to the general equation (16) the electronic conductivity 
is the rate determining factor if the intermediate layer is an ionic 
conductor. An example is the formation of AgBr on a silver surface 
exposed to bromine vapour. Here the rate of bromination will decrease 
with decreasing electronic conductivity and decreasing concentration 
of electron holes. In contrast to these systems with an ionic conducting 
layer the rate of oxidation can be increased or decreased in systems with 
an electronic conductor as intermediate layer if the concentration of the 
lattice defects is altered, for in this case the ionic conductivity is the 
rate determining factor. 


Oxidation Processes Involving Ionic Conducting Layers 


Consider the reaction between silver and gaseous bromine. According 
to the general scheme silver ions and electrons migrate from the inter- 
face between Ag and AgBr across the bromide to the outer surface and 
here they react with adsorbed bromine atoms. Whereas the concentra- 
tion of silver vacancies and silver ions in interstitial positions is 
independent of the vapour pressure of the bromine, the concentration 
of electron holes is a function of the partial vapour pressure of the 
bromine. According to WaGNER we have*® 


+Br,” = AgBr + Ago’ + 9. noo « Ce 
Applying the law of mass action we obtain for the hole conductivity 


20) 


Ke = K¢ PBr,~! Par, 7 


and finally by combining with equation (16) for the rate of bromination 
Ns opr { 300 RT ) 1 
~ 19 Keren. @1) 2” Pas 
A 196500 ~?Br:~*’ Ne Par, Ax 


(21) 
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if Par, = Par, > Pp;,- Im agreement with this, WaGNER was able to 
show that between 200 and 400°C the rate of bromination is propor- 
tional to the square root of the bromine vapour pressure. Furthermore 
the transport number ¢, of electron holes at 200°C and p,, = 0-23 atm 
is expected from bromination experiments to be about 17 per cent, 
whereas from conductivity measurements it was found to be 12 per 
cent.*° This result is in satisfactory agreement with WAGNER’s assump- 
tions. The same mechanism may be assumed to apply to the reaction 
of silver with chlorine. 


Oxidation Processes Involving Excess Semi-conducting Layers 
Next, we consider the reaction between zinc and oxygen. Experiments 
showed that the electric conductivity of zinc oxide is mainly due to 
quasi-free electrons © and, to a lesser extent, to interstitial zinc ions. 
Consequently, in contrast to the halogenation of silver, the rate 
determining factor in the oxidation of zinc is the migration of zinc ions 
via interstitial positions, i.e. the ionic conductivity. Applying the law 
of mass action to equation (7) 
—_ i —1/2. 99 
Cano Co = Apo” **; coos 


and putting cz,.- = Co, we obtain for the conductivity due to zinc ions 


—- —l/4 of 
—_ Kzn(p9,~1) Po, ° * eo 9 (23) 


Proceeding in the same way as in the derivation of equation (21) we 
finally obtain for the rate of oxidation 


fizng | 300 RT Ty 7 ow , 
4 \96500 “2"0,-” We \W >a / eo: oe lee 
; ' Po, 7 


It follows from equation (24) that the rate of oxidation of zinc is 
independent of the partial pressure of oxygen. This conclusion’ was 
experimentally confirmed by WaGNER and 
GRUENEWALD® and can easily be under- 
stood. Fig. 6 shows the distribution of 
interstitial zinc ions and of quasi-free 
electrons in the lattice. Near the oxide 
metal interface the oxide is rich in inter- 
stitial zinc ions and quasi-free electrons, 


whereas their concentration at the outer 





surface is very small. It is, therefore, 
apparent that the concentration gradient 
of Zn©* in zinc oxide, and thus the rate yond “eo eps ‘anaes 
of oxidation of zinc, should be almost grey a ft Asst 
independent of the oxygen pressure. 

The rate constant of zinc oxidation, as found by WaGNeER and 
GRUENEWALD, is k, = 7°5 . 10-" g? cm~* h~ at 400°C. In contrast to 


Fig. 6. Concentration gradi- 
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this, the rate constant of oxidation of spectroscopically pure zinc under 
exactly similar experimental conditions was found by GENscH and 
HAUFFE”™ to be &, 9. 10-" at 390°C. It must be assumed that the 
high value of &, is a sign of great purity of the zinc used. The oxidation 
of zine alloys, mentioned below, will show that this conclusion is 
justified 

It is to be expected that the oxidation of cadmium proceeds according 
to the same mechanism, for Farvre and MicweLt™ found that an 
increasing excess of metal leads to an increase of the lattice parameter. 
This as well as the dependence of the electric conductivity on the 
partial pressure of oxygen makes it probable that there are metal 
ions in interstitial positions, and quasi-free electrons. Conductivity 
measurements carried out by Earve,”® and Havurre and associates* 
on TiO, show that this, too, is an excess semi-conductor; however, it 
contains no interstitial cations, but oxygen ion vacancies as shown by 
Emruicu.'"* GULBRANSEN and associates** have investigated the 
dependence of the rate of oxidation of titanium on oxygen pressure. 









Oxidation Processes Involving Deficit Semi-conducting Layers 
Good examples of this type of process are the reactions of iron and 
copper with oxygen, the wiistite-phase and cuprous oxide having been 
investigated in particular by Jetre and Foote,” DarKEN and GurRry,* 
and by WAGNER and associates.**° Unfortunately the rate of oxidation 
of iron to FeO at 950 and 1000°C is determined by surface reactions.*® 
We shall, therefore, rather consider the oxidation of copper investigated 
by WaGNER and GRUENEWALD.™ 

Since Cu,O is an electronic conductor, the velocity of diffusion of 
copper ions and electrons is determined by the more slowly moving 
copper ions, moving via vacancies. According to equation (16) the 
copper ion conductivity «,, = «.t, is the rate determining factor. 


Using the experimental result 


















Koy Ky (Po, Po, 





and the relation 





du, = bdo, = }RT dinpo, ; a oe oe 





we obtain for the rate of oxidation of copper 


Reu.o 300 ‘ 7 RT ; l 


4 96500 Cu 0. l ri Ne | V Po, \ Po, } As . ° ° ° (24a) 

















K' 
V Po, } Ax 






(276) 


L + teu) Boag: kT’ (V Po,” 






The mobility of the cation vacancies can be calculated from the electric 
conductivity, the transport number of the cations, and the copper 
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deficit in cuprous oxide. These data have been used by WAGNER and 
GRUENEWALD to calculate the values of the rational rate constant k, 
as indicated in Table 1. Observed and calculated values of k, are in good 
agreement. Furthermore, the experimental observations led to 
k, & Po," 


relation i, oc po.’ (see Fig. 7). 


, in fairly good agreement with the theoretically expected 


TABLE 1 
Calculated and Measured Rate Constants of the Formation of 
Cuprous Oxide on Copper at 1000°C as measured by WAGNER 
and GRUENEWALD 





Rational Rate Constant k, 


equival nts .em~!. e-') 


Calculated Observed 

















PsA stm) — 


Fig. 7. Dependence of the rate of oxidation of copper on the partial 
pressure of oxygen at 1000°C according to WAGNER and GRUENEWALD 


SHOCKLEY and associates,) and CASTELLAN and Moore,’ have 
investigated the self-diffusion of copper in cuprous oxide which was 
growing on copper but unfortunately the cuprous oxide was not 
homogeneous. Therefore their self-diffusion coefficients cannot be used 
in WaGNER’s third formula (27c) for the rate of oxidation of copper 
(cf. equation (18)), namely 


(1 ++ Sen DeDAPL1 — (Coer-/efar)} . « « » (ST) 


“Cu 
where D>... is the self-diffusion coefficient of copper ions in cuprous 
oxide in equilibrium with oxygen. 
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In the same manner WAGNER and GRUENEWALD have investigated 
the oxidation of nickel and found k, « Po,’ *. whereas theoretically 
k, & Po,"* should be obtained (see equations (48) and (49)). The dis- 
crepancy between experimental results and theoretical expectation is 
not yet understood. Recently Haurre and RaHMEL” have investigated 
the reaction between iron and sulphur at 670°C. According to the 
equation 


1S,@ = FeS + Fed” + 2¢ Lt 


the rate of sulphuration of iron is a linear function of the 1/4 and | 
power of the sulphur vapour pressure (see Fig. 8). The theoretical 
relation is k, oc ps°. The agreement is fairly good. 





a 
~—hl | 
| 

Carbony/-iron | 








mmiig 


Fig. 8. Dependence of the rate of sulphuration of iron on the 
partial pressure of sulphur at 670°C. according to HAUFFE and 
RAHMEL (& in g*.. cm~*. s=! 


APPLICATION OF WAGNER’S THEORY OF OXIDATION TO THE 
OXIDATION OF ALLOYS 


In the above chapters we considered the oxidation of pure metals and 
developed quantitative relations between transport of both ions and 
electrons and the rate of oxidation of metals. In this chapter we shall 
consider the influence of the additions of small amounts of a foreign 
metal on the rate of oxidation. Accepting WAGNER’s theory of oxida- 
tion and assuming that the mobilities of the disordered ions and electrons 
will not be markedly changed by the presence of foreign ions, we must 
diminish the concentration of ionic or electronic defects in order to 
obtain a decrease in the rate of oxidation. Conversely the velocity of 
oxidation will increase on increasing the concentration of ionic defects, 
electron holes or quasi-free electrons. To understand this we must 
consider the application of the law of mass action to ionic and electronic 
lattice defects in mixed crystals. 


Oxidation Processes Involving lonic Conducting Layers 


Waoner,®’ Havrre and associates**-*. *%, have shown that the 
concentration of vacancies and ions in interstitial positions, or of 


82 





MECHANISM OF OXIDATION OF METALS AND ALLOYS 


electron holes and quasi-free electrons, and likewise the electrical 
conductivity can be greatly varied by the addition of ions of higher or 
lower valencies. This will be shown for the case of silver bromide. For 
a FRENKEL type of defect we have the following equilibrium 

AgBr q 2 AgO’ + Ago’ (oe ae 
and with the law of mass action we obtain 

Geary «tua EE. —— 

This equation is valid for pure silver bromide and for silver bromide 
containing small quantities of another metal 
bromide. In the case of pure silver bromide “9 
we have the special condition Br- Br- 


(31) Ag” 


undisturbe 


+ r- Ag? 


Cagn’ = Cagc 
Br- Ca** Br 
If cadmium bromide is dissolved in silver 
° ° ° ° * yr QQ. ice fects 

bromide, divalent cadmium ions replace Fis: 9% lattice defects in 
J : q ; Ao! a mixed crystal of AgBr- 
Sliver 1ons an one cation vacancy Ag caBr, as proposed by 
appears for each divalent cadmium ion re- WAGNER 
placing a silver ion in the lattice Cd@ (Ag) 
(see Fig. 9). The equation for the solution of cadmium bromide in 
silver bromide then reads 


CdBr, = Cd@'(Ag) + Ag’ + 2AgBr. ... . (32) 


Consequently, because of electro-neutrality, in a binary solution the 
number of cation vacancies is equal to the number of cadmium ions 
plus the number of silver ions in interstitial positions 


Cagr’ = Sago: T “cam (Ag) (32) 


Using (30), (31) and (32) we have 


(33) 


where a = Coa (ag): 


It follows from (33) that on increasing the concentration of cadmium 
bromide in silver bromide the concentration of cation vacancies and, 
therefore, the electric conductivity will increase. Experiments carried 
out by Kocu and Wacner®™ and Tettow™ confirm the above con- 
clusion. This result is of importance for the interpretation of the rate 
of bromination of silver alloys as determined by Havrre and 
GEnscH.*!, 21 

For the rate of bromination of silver and silver alloys the electron 
hole conductivity is the rate determining factor and we must, therefore, 
find a relation between the hole conductivity and the cadmium con- 
centration in silver and the CdBr, concentration in AgBr, respectively. 
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Applying the law of mass action to (19) at constant bromine vapour 
pressure 
c Ce K (for py,, = const). ace a ee 


Ag 


Observing the relation 











204% 0-06 
Ned ——— 


Fig. 10. The dependence of the ratio of the rate constant of bromination 
of the silver-cadmium alloy, &, to that of the pure silver, &°, on the 
if cadmium added Ney (atom fraction) as measured by HAUFFE 


330°C and ppr, 170 mm Hg), Curve 1 observed 


amount « 
and Gensca (T 
und Curve 2 calculated 


and using equation (33) we finally obtain for the ratio of the oxidation 
rate of silver-cadmium alloys & to that of pure silver & 


(36) 


Using Tettrow’s value for Cre - 2:9. 10-3 at 330°C the calculated 


values of & agree satisfactorily with the values of & obtained by the 


experiments shown in Fig. 10. Silver-lead and silver-zine alloys 


behave similarly under identical experimental conditions (see Fig. 11). 


aN 








Fig. 11 Dependence of the rate constant of bromination of the silver 

alloy, & (equ. cm s~'), on the increasing amount of cadmium, lead 

or zinc added as measured by Genscu and Haurre (T 330°C and 
PBr, 170 mm Hg) 
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According to the order of increasing solubility, ZnBr, < PbBr, 


< CdBr,, we have 
k: Ag/Cd } Ag/Pb k Ag/Zn 


(; (; (7 


Similar results were obtained by Himmuer,*’ who found that the rate 
of chlorination of silver is decreased by the presence of lead chloride 
and cadmium chloride. The chance of developing silver alloys more 
resistent to attack by halogens at high temperatures than silver-cadmium 
alloys does not seem to be very great at present, if it is remembered 
that only metals which are soluble in silver, whose ionic radius is 
1-1 + 0-2 A and whose valency is 2 or 3 
can be used. Only if the radius falls within 
the above limits can a solubility in AgBr 
large enough to produce a high concentra- 
tion of silver ion vacancies be expected. 
Employing GoLpscHMIpT’s values for the 
ionic radii only the following cations need be 
considered: Cd?* (1-03), Pb** (1-32), Cu*®* 
(0-96), TI®*+ (1-05), Sr®+ (1-27) and La* 
(1-22). The use of silver alloys containing 
more than one alloying metal is not desir- 
able, because the more soluble metal ion 
diminishes the solubility of the other metal Mole ‘e Foren Oxide 
ions in silver halides.*® The resistance of 
ternary silver alloys is thereby often reduced. 


min min min 














Fig. 12. Variation of the 
electrical conductivity of 
, ; zinc oxide with increasing 
Oxidation Processes Involving Quasi-free amounts of trivalent and 
Electron Conducting Layers monovalent cations at 
of 390°C as measured by 
HAUFFE and VIERK 

1 atm 


We shall now consider the influence 
small amounts of aluminium and lithium in p 
zinc on the rate of oxidation of zine. If we > 
oxidize a zinc-aluminium alloy at 390°C in an atmosphere of oxygen, 
a layer of zinc oxide containing Al,O, is formed. Measurements of 
the electric conductivity of zine oxide carried out by Havurre and 
VrerkK® (see Fig. 12) showed that with increasing amounts of Al,O, 
the concentration of quasi-free electrons increases while that of zinc 
ions in interstitial positions decreases. The equation corresponding to 
the solution of Al,O, in zinc oxide is 


Al,O, = 2Al9'(Zn) + 20 + 2Zn0 + 340, . . . (37a) 


40, + 2Zn0° + Al,O, = 2Al@'(Zn) + 4Zn0O. (37b) 


Fig. 13 shows aluminium ions in zinc ion positions. Because of 
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electro-neutrality quasi-free electrons © will appear while interstitial 
zinc ions will disappear; for the same reason we have 


Cc Czy T © @ (Lo . (38) 


Thus, using equation (38) and substituting in equation (22) at constant 
oxygen pressure, we obtain for the ratio of the rate of oxidation constants 
of the Zn/Al alloy & to that of the pure zinc & 


where ¢,, 
lattice points. 


Measurements of the electric conductivity of mixtures of zinc oxide 


Cag za) 8 the concentration of aluminium ions at zine ion 


\s alee = °.+* \= 
oe n 


Fig. 13. Lattice defects in Fig. 14. Lattice defects in 
s mixed crystal of ZnO- a mixed crystal of ZnO- 
ALO, Li,O 


and aluminium oxide at 400°C showed that the concentration of inter- 
stitial zinc ions in pure ZnO can be estimated as c,,... ~ 4. 10-°.” It is 
easy to show, therefore, that if the addition amounts to more than 
0-1 atom per cent, equation (39) becomes 


k c 
my ae, 6 nae! es ag 
k 


Cal 
We can treat the oxidation of a zinc-lithium alloy in the same way. 
For zinc oxide containing lithium oxide we have 


29 + Li,O = 2Li@’(Zn) + 2Zn0. .. . (4:1) 


Assuming the validity of equation (41) we see that adding lithium oxide 
to zinc oxide causes an increase in the concentration of interstitial zinc 
ions and a decrease in the concentration of quasi-free electrons (see 
Fig. 14). 

Furthermore, the condition of electro-neutrality requires 


. » 
Cz ( 7 Cri@(zay s+ « « (42) 


Thus, combining equation (42) and the equilibrium condition at con- 
stant oxygen pressure, equation (22), we finally obtain for the ratio of 
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the oxidation rate constant of the Zn/Li alloy k to that of pure zinc k 


i 
- (43) 


») 
=f Zu 
and if 1 CLi@(Zn) 


k aT 
~ —, cam (44) 
k Czy 


The experimental results obtained by Genscu and Havurre* confirm 
the validity of the assumption made (see Fig. 15). Table 2 shows the 
observed and the calculated values for the oxidation rate constants of 
spectroscopically pure zinc and for zinc alloys. The curious dependence 


TABLE 2 
Calculated and Measured Rate Constants of the Formation 
of Oxide on Zinc and Zinc Alloys at 390°C, as measured by 
Genscu and Havurre (p,, 1 atm) 





Rate Constant of Oxidation k (g* em~* h 
Zim alloys in atam 
per cent 
Observed Calculated 


» pure 
0-1 Al 

+ LOA! 
0-4 Li 
0-1 TI 
0-5 Tl 
1-0 TI 



































Time 
Fig. 15. Influence of foreign metals on the rate of oxidation of zinc, 


as measured by GENSCH and HAUFFE (Po, latm and 7 390°C) 


Si 
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of the rate of oxidation of the zinc-thallium alloys on the thallium 
content of the alloy is remarkable. As shown by Fig. 15 the rate of 
oxidation of an alloy containing 0-1 atom per cent Tl is greater than 
that of pure zinc, whereas a greater thallium content (0-5 and 1-0 
atom per cent) leads to a rate of oxidation which is lower than that of 
pure zinc. This behaviour can only be understood if it is tentatively 
assumed that small additions of thallium enter the zinc oxide lattice as 
univalent cations whereas greater quantities enter as trivalent cations. 





600°C 














Tv0, 0-002 0-004 0006 0008 
Nwo, _ | 
Fig. 16. Dependence of the electrical conductivity of TiO, on the in- 
creasing amounts of WO, (Nwo, mole fraction) added (p,;, = 1 atm). 
According to HAUFFE, GRUNEWALD and GREE3E 


Since the common impurities of zinc are aluminium, iron and gal- 
lium, but not alkali metals, oxidation produces zinc oxide layers con- 
taining trivalent cations and giving a lower rate of oxidation than 
observed for pure zinc. It is, therefore, permissible to assume that the 
high value of the rate of oxidation of zinc mentioned on page 79 is a 
direct indication of the purity of the zinc used. 

From the theoretical viewpoint, and according to conductivity 
measurements on mixed crystals of TiO, and WO, recently carried out 
by Haurre, GRUNEWALD and Greese* (cf. Fig. 16), the rate of oxida- 
tion of titanium may be expected to diminish in the presence of tungsten, 
which decreases the concentration of oxygen ion vacancies. An investi- 
gation of the oxidation of titanium alloys from this special point of 
view has not yet been made. The dissolution of tungsten oxide in 
titanium oxide can be formulated as follows: 

WO, = W@’(Ti) + 20 + TiO, + 40, ... . (45) 
00° + WO, = W@’ (Ti) + + TiO, + 40,. . . (45a) 
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According to the tentatively proposed equilibrium, 


TiO, = 00" + 40,, ; a eee 


the rate of oxidation of titanium and titanium alloys, in contrast to the 
oxidation of zinc and its alloys, seems to be determined by the diffusion 
of oxygen ions. Havurre and Prerrrer® show that the rate of 





























Decrease of the rate of oxidation of titanium due to the 
presence of WO, (curve II) in oxygen of 1 atm at 800°C as 
measured by HAUFFE and PFEIFFER 


oxidation of titanium decreases in the presence of WO, vapour in 
oxygen (Fig. 17). 


Oxidation Processes Involving Electron-hole-conducting Layers 


The rate of oxidation of a metal covered by an electron-hole-conducting 
oxide layer can be increased or decreased by addition of a foreign 
metal whose oxide dissolves in the mother oxide and which gives 
monovalent or trivalent cations respectively. The cause of this 
behaviour can be understood by applying the theory of lattice defects 
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to mixed oxide crystals. Fig. 18 shows schematically what happens 
when monovalent ions, i.e. Li*, enter the crystal lattice of nickel oxide 
which is of the hole conducting type. According to the equation 


10,@ + Li,O = 2Li@’(Ni) + 20 +2NiO ... (47) 
Nive O™  NT*O™ NUT 0” where Li@’(Ni) represents a lithium ion 
O* Li* O* Ni** 0* Ni** in a nickel position and the ‘ the negative 
mee fn senate > on charge of this “impurity centre,’ we 
Vu 0 Nu Q Livy 0 A 
=; = ne anes ne aceee Obtain an increase of the electron hole 
U uw UF N** QO Nv 


concentration with an increasing con- 
centration of lithium oxide, and, therefore, 
an increase in the electrical conductivity. 
The conductivity measurement of VERWEY 
and associates’* have confirmed the presence of this type of defect 
(see Fig. 19). 


Ni*** O* Ni** 0? Ni** 07 


Fig. 18. Lattice defects in a 
mixed crystal of NiO-Li,O 





















Fig. 19. Dependence of the electrical conductivity of NiO on the 
increasing amount of Li,O added, as measured by VERWEY, HAAYMAN 
and RoMEYN at 20°C 








In analogy to equation (4) the equilibrium condition for lattice 
defects in nickel oxide in an atmosphere of oxygen reads 


10, = NiO + NiO” + 20. 5 «ge 








Applying the ideal law of mass action, we get 






Cyi>-- Ce = K (if po, = const) (ce ei 






where cy,.- represents the concentration of nickel ion vacancies. 
Furthermore, electroneutrality leads to 






» 


Ce = Wyin- + Crigixy: occ eo (FH 






If cy; = Crigixi) > Cxice and if we substitute silver ions for lithium 
ions, then the ratio of the oxidation rate constant of a nickel-silver 
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alloy to that of pure nickel k&°, calculated from equations (49) and 


(50), equals k 2c°,_.\2 
= & (—*) a 
I: Cag 
The decrease in the rate of oxidation due to the presence of silver in 
nickel is far less than expected from equation (51). The cause seems to 
be the small solubility of silver oxide in nickel oxide, as observed even 
at 1000°C by Havurre and PFrerrrer.” 
Making use of the remarkably great solubility of lithium oxide in 
nickel oxide and the fact that at 1100°C the vapour pressure of lithium 


























Mole %. Cr.0—.-——"> 
c . 


Fig. 20. Dependence of the electrical conductivity of NiO on the in- 
creasing amount of Cr,O, added, according to HAUFFE and BLOCK 


oxide is satisfactorily high, Havurre and PFEIrrEeR® investigated the 
attack of oxygen on nickel at 1000° and 1100°C in the presence of 
Li,O vapour. According to equations (47), (50) and (51a) 

* 2° 2 

» fame .. . . (81a) 

k* Clio 
we obtain a decrease in the rate of oxidation of nickel (see Fig. 19). 
At present a quantitative evaluation is not 
yet possible. 

On the other hand, if chromium oxide is 
dissolved in nickel oxide the electric con- Ni’ 0" MO" O 
ductivity is lowered considerably (see Fig. O= Cr°**O= Cr**0* Ni** 
20.6 Accordingto Wacran's scheme show” yn, Late dicts 
ions lead to an increase in the concentration prerrcntane min pe 
of nickel ion vacancies and to a decrease in 
the concentration of electrons. The equation for the solution of 
Cr,O, in nickel oxide then reads 


2@ + Cr,0, = 2Cr@'(Ni) + 2NiO + 40, . . . (52a) 
Cr,0, = 2Cr@‘(Ni) + NiO” + 3NiO. .. . . (526) 


* 


Grr 0° Ni** 


0* 
o OF OF m**0* 


or 


Here the symbol Cr@'(Ni) represents a chromium ion in a nickel ion site. 
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Remembering that the condition of electro-neutrality holds in the 
NiO-Cr,O, phase we obtain 
(53) 


. - le 
Cyi-° Ce 


and, approximating, 
Cy; o be, (if Cos - C, @ (Ni) Cy; on. ; Z p f (54) 


The ratio of the oxidation rate constant of a nickel-chromium alloy k to 
that of pure nickel &° is 


> Cc Cc ole = 2% v0 

-* 32, (ul Cr Si (55) 

Although there had been many empirical investigations on the oxida- 
tion of nickel alloys, the first experiments with the theory of lattice 
defects in view were carried out by WaGNER and Zimens.* In agree- 
ment with the requirements of equation (55), Table 3 shows that the 
rate of oxidation of nickel increase with increasing amounts of chromium. 
The relatively rapid decrease in the rate of oxidation at higher concen- 
trations of chromium (> 6 atom per cent Cr) is probably due to the 
formation of a compact layer of the spinel phase NiCr,O,, in which 










ionic diffusion processes occur extremely slowly. 










TABLE 3 


of oxidation of nickel-chromium alloys at 












The rate 





1000 C 


as measured by WaGNER and ZIMENS 
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As was pointed out by WaGNeR and Zimens the quantitative dis- 
cussion of the experimental results is rather difficult, because we must 







assume that nickel and chromium ions do not possess the same mobilities, 






so that we are here dealing with a solid solution whose composition is a 
function of the distance from the oxide/metal interface. If the mobility 
of the chromium ion is small, the region near the oxide/metal interface 







will be rich in chromium oxide. Conversely, if the mobility of the 
chromium ion is the larger of the two, the concentration of chromium 
oxide near the outer surface will be above the average concentration. 
At temperatures above 1000°C no accumulation of Cr,O, near the 
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outer surface is in fact observed, but this may be because Cr,O, is 
rather volatile. Concerning this problem an electron diffraction and 
electron microscope study carried out by GULBRANSEN, PHELPs and 
HickMan* is of importance. The quantitative treatment of the 
different mobilities of the divalent nickel ions and trivalent chromium 
ions opens up new possibilities of understanding many of the rather 
complicated oxidation processes of alloys. 

The oxide layer formed on nickel-chromium alloys with 10-20 per 
cent Cr is composed either of a spinel (> 1000°C)’. © or of Cr,O, 
(< 1000°C)*. , In addition the greater mobility of chromium ions in 
spinel and NiO-Cr,O,-layers observed by HaurFre and PscHERA“ seems 
to be of importance for the formation of oxide layers. In the case of the 
oxidation of iron-aluminium alloys an increase of Al,O, in the oxide 
layer was found by KornmLow and ScHPIrKELMANN.® An extensive 
discussion of this matter was given by Havurre.* 


THE FORMATION OF MULTI-LAYERED SCALE CONTAINING 
Various Meta OxiprE PHASES 


One of the important factors determining the resistance of a metal or 
alloy to oxidation at high temperatures is the structure of the oxide 
layer formed at the surface. An extensive investigation of structure 
has been carried out by GULBRANSEN and associates.**-* Applying the 
electron diffraction reflection technique they could observe the structure 
of the oxide layer in situ at temperatures at which it is actually protec- 
tive. The results of these measurements are of particular importance 
for the class of oxidation processes in which, from a thermodynamic 
point of view, two or three oxide layers can be expected. 

One of the most thoroughly investigated processes is the oxidation of 
iron at different temperatures and pressures of oxygen. The first 
relatively complete studies of the oxidation of iron at elevated and low 
temperatures was carried out by JACKSON and QUARRELL* and Fiscu- 
BECK’* and Netson.® They recognized that the formation of different 
oxides and their structural transformations are functions of time as 
well as temperature. They found that below 400°C the stable oxide on 
iron was Fe,O,, and that FeO was formed between 515 and 900°C. 
Between 400 and 515°C, FeO, Fe,O, or a mixture of the two oxides 
were found. Jrmorr® could distinguish between the cubic Fe,O, and 
Fe,O0, by means of x-ray analysis; by electron diffraction this would 
have been impossible. From x-ray evidence JIMORI postulated a 


transition of y-Fe,O, with cubic spinel structure to a-Fe,O, with 
rhombohedral structure at 340°C. GULBRANSEN and HicKMAN found a 
lower transition point at about 225°C. For long periods of oxidation at 
up to 400°C they observed the formation of a-Fe,O,. Alongside this 
evidence we should note that for thick films formed at 500°C the 
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inverse spinel Fe,O, is the only oxide observed. Finally, we have studies 
by Bénarp and associates‘ of the influence of temperature on the 
kinetics of formation of different iron oxide layers (see Fig. 22). Thus 











Fig. 22. Influence of temperature on the kinetics of formation of 


different iron oxide layers, as measured by BENARD and COQUELLE. 
N mole fraction of FeO, Fe,O, and Fe,O,, respectively) 


among all these results there seem to be many disagreements; they are 
due to difficulties which may be listed as difficulties of 


a) reproducibility, 

6b) accurate measurement of the surface temperature, 
(c) accurate determination of the time of oxidation, 
(d) controlling the film thickness, and 

e) controlling the pressure of oxygen. 


Recently BrrcHENALL and associates** have investigated the 
mechanism and the kinetics of the scaling of iron under the pressure of 
| atm of oxygen between 400 and 1200°C. The results indicate that in 
no case, above a temperature of about 625°C, does the layer contain 
more than 10 per cent by volume of the higher oxides Fe,O, and Fe,O, 


TABLE 4 
Approximate percentages of FeO, Fe,O, and Fe,O, in 
scale formed on iron Oxidized im Oxygen; according to 
Davies, Stmyap and BrecwENALL 





r™ ¢ min FeO pet Fe,O, pet | Fe,O, pet 
600 100 7h a 

625 110 70 30 

700 225 8 > 4 

750 100 6 4 

800 100 16 2 2 
900 100 00 4 < | 
1000 25 95 4 es 
1090 25 “4 5 <i 
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(see Table 4). To measure the rate of growth of Fe,0, and Fe,O,, 
samples of FeO and Fe,O, were prepared and oxidized in oxygen. The 
values for the parabolic rate constants are given in Table 5. Below 


TABLE 5 
Values for the Parabolic Rate Constants k (g em~* min-! ”) 
of the Oxidation of Iron in Oxygen; from Davies, 
SIMNAD and BIRCHENALL 





Remarhe 


400 
500 
600 
700 
BOL 
v00 
1035 
1130 


1200 


Oxidation of Fe to form 
multilayer scales 


850 
910 
930 


950 


BU0 
900 5. Oxidation of Fe,O, 
988 





570°C the oxide layer formed is almost pure Fe,0,. The reactions 
involved, neglecting the formation of Fe,O,, are: 


3Fe + 20, = Fe,0, . « « « (56a) 


3FeO + 40, = Fe,0,. coo « OP 


BIRCHENALL showed that, for a given thickness of Fe,O,, the increase 
in weight for process (56a) is four times as great as for process (560). 
Now BIRCHENALL and associates tentatively suggest that the growth of 
each layer is a diffusion controlled process and that the interface reaction 
rates may be neglected. Applying a modified Pfeil technique using 
radioactive silver Ag™® as inert markers they find a very interesting 
diagram of the oxidation process of iron (see Fig. 23). Whereas for the 
formation of FeO the rate determining step is the migration of Fe*+- 
ions, in hematite it is not the iron ions that wander but the oxygen ions. 
In magnetite, however, both ions migrate. These results are consistent 
with the semi-conducting properties of the oxides of iron. 

On oxidizing copper at 1000°C at an oxygen pressure of about 
100 mm Hg, an outer and an inner layer of CuO and Cu,0, respectively, 
are formed. Just as the molar volume of Cu,O is greater than that of 
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metallic copper, so the molar volume of CuO is greater than that of 
Cu,0 (lg— atom of Cu =7-lem*; 4 mole of Cu,O 11-7 cm?; 
1 mole of CuO = 12-3cm*). The cupric oxide layer on the cuprous 
oxide layer is, therefore, compact. Under these conditions PILLING 
and Bepwortu,® and also Ferrknecut”’ and FrRogxs.icna” found that 
the rate of oxidation of copper is independent of the oxygen pressure. 
This experimental observation is easily understood. The excess of 





Fig. 23. Schematic diagram of the diffusion of ions across the different 
oxide layers during the oxidation of iron on the basis of the experimental 
results of DAVLES, SIMNAD and BIRCHENALL 


oxygen in the Cu,O phase at the Cu,0/CuO interface is determined by 


the two-phase equilibrium Cu,O0 + CuO, as is also, therefore, the 
concentration gradient of copper ion vacancies in the cuprous oxide 
layer. On the basis of the rate of oxidation of copper to Cu,O, and of 
Cu,0 to CuO, Wacyer™ suggested that the ratio by weight of the 
two oxides formed (at 1000°C and an oxygen pressure of | atm) is 
given by: 

Az, KR -cu0 

Ateuo *ncuo 


0-4. 10-3 


i.e. under the above experimental conditions practically pure cuprous 
oxide is formed, only the outer surface, representing 0-1 per cent of the 
total thickness of the film, consisting of cupric oxide. It can be assumed 
that the sulphuration of iron at 670°C and a sulphur vapour pressure of 
| atm gives to an iron sulphide layer of analogous composition. Here 
the rate of sulphuration of iron to FeS is a thousand times faster than 
that of FeS to FeS,, as was recently shown by Havurre and RanmMeE.*”’ 
(keg = 33.10-% and ky. = 1-4.10-" (equ. cm s~)). A great 
number of metals and alloys have been investigated from the same point 
of view, in particular by GULBRANSEN and associates.**-** 


96 





MECHANISM OF OXIDATION OF METALS AND ALLOYS 


THE ForMATION oF Turn Compact OxipE Fitms on Mertas 
Besides WaGNER’s theory Mott and CaBrera® have recently put 
forward another theory of the oxidation of metals. WaGNeErR’s formula 
is valid only if the oxide layer is so thick that the concentrations of 
interstitial ions (or ion vacancies and free electrons, or electron holes) 
are equal to one another throughout most of its thickness. Morr and 
CABRERA derive a formula especially for the rate of oxidation in thin 
and very thin oxide films with a thickness between 10 and 1000 A. 
Applying the fundamental assumptions of the Scnotrky-Morr 
boundary layer theory”: * (Randschicht-Theorie) of crystal rectifiers, 
Mott® was the first to show that in the vicinity of the metal/oxide 
interface there is a diffuse electrical double layer because the electrons 
move more quickly to the adsorbed oxygen atoms than the ions. This 
process plays an important role if the oxide film is thinner than 100 A. 
In such a thin film the electrical field strength is so high that the rate of 
migration of ions is no longer proportional to the field strength (i.e. it 
is no longer inversely proportional to the thickness of the oxide film) 
but depends quasi-exponentially upon the thickness. 

Mott and CaBRERa theoretically gave different conditions requiring 
the logarithmic law, the parabolic law (z* = 2kt), the cubic law 
(2° = 3kt, experimentally confirmed by CampBELL and THomas*® for 
the oxidation of copper), and lastly the linear law (z = kt, experi- 
mentally confirmed by GULBRANSEN and Wysonc* for the oxidation of 
aluminium). WaGNerR® recently pointed out that the possibilities are 
not yet exhausted. Whereas Mort's theory of oxidation processes 
assumes one double layer at the oxide/metal interface, WAGNER expects 
during the growth of an oxide film an overlapping of two double layers, 
the one at the metal/oxide interface and the other at the oxide/oxygen 
interface. Thus another interpretation of the logarithmic law seems 
possible, especially for films with a thickness greater than 100 A. 

Moore and Lee’s experimental results®* for the oxidation of zinc, 
which cannot be explained either by the theory of Mort-CaBRERa or 
by WaGNER’s theory, led ENGELL and Havurre™ to give an explanation 
which might be of general interest and will, therefore, be described in 
some detail. For layers of up to 1000 A Moore and Lez found that the 
parabolic law of growth (and at the same time the rate of growth) was 


dependent on the pressure of oxygen, whereas WAGNER'S theory predicts 
a rate of growth independent of the oxygen pressure. WAGNER and 
GRUENEWALD had, indeed, found for thick layers (> 1000 A) a para- 
bolic growth with the rate of growth independent of oxygen pressure. 
Using the ScuotrKy-SPENKE boundary-layer theory of crystal recti- 
fiers, ENGEL” and Havurre advance an explanation of the above 


observations. 
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During the formation of the oxide layer two regions must be dis- 
tinguished. As shown by Fig. 24 there is a small region A in which 
an electric field has been set up because of a depletion of quasi-free 
electrons which, owing to their relatively great mobility, have been 
captured by the chemisorbed oxygen atoms on the surface. In this 
region the migration of the interstitial zinc ions is mainly due to this 
electric field, while in the region B the motion of the interstitial zinc 
ions is mainly caused by the concentration gradient. The thickness / 
they call the critical thickness. 

8 
interior 


f Oxide 
A 





)xygen on 
|” 


hem) sorbed 


vaygen 





-_—C“_e 


Fig. 24. Scheme of the distribution of quasi-free electrons and interstitial 

zinc ions in the boundary-layer (region A) and in the interior of the 

semi-conductor (region B). (/ is the critical thickness of the boundary 
layer) 


For both regions of the oxide layer the current of quasi-free electrons 
I. and of interstitial zine ions /, is given by the usual equations. 


on 


I,=D “= — E(z).n,.B . . « « (57a) 
x 


in. 
le -——+9,.E(zine. Be ... . (576) 


Here B is the mobility in cm? . V-' s~', ng and nq the concentration 
of interstitial metal ions and quasi-free electrons respectively, vz the 
charge of the metal ions in interstitial sites and E(x) the electric field 
strength in V .cm~- at the point z. Equations (57a), (576) and the 
Boltzmann equation 


(58) 


exactly determine the distribution of positive and negative charges 
in the boundary layer. Here V > is the diffusion potential, np, and ny 
the concentration of quasi-free electrons in the boundary layer and in 
the interior of the oxide layer. 
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The system is stationary, if the concentration of interstitial cations 
in the boundary layer is such that the divergence of the current of 
cations moving across the whole layer (A and B) is zero. Since the 
electric field may be neglected in the interior of the oxide (EZ = 0 for 
x > l) we obtain, neglecting the transport of cations in the boundary 
layer by diffusion, 


—_ (vere (x) Br ° E(x)),. .* 


With Dy = BgkT/e and Poisson’s equation d2/dx = 


e being the permittivity, we obtain on integrating between the limits 
x = 0 and xz =/ for the concentration of interstitial cations in the 


boundary layer 
1/2 
(i—z)*.... (0) 


(e kT (ox) 


Npe(z) = |\——_ . — - - | - 
= \87re Vee Ox z J 
By means of a second integration from 0 to / we finally obtain for the 


thickness of the boundary layer 


l= (61) 
e 


1 = const 
ox /z 1) 
For a critical discussion of these equations see ENGELL, HAUFFE and 
ScnoTrKy.” 

Using the equation for the chemisorption of oxygen on zinc oxide 


40, + 26) = O?-™ — 


) 


Havrre and EnGeti® finally obtain for no.-“ 
lje 


00 «ee 
4 |e 


ni (63) 


- Ny . 


Therefore n‘\), the number of chemisorbed oxygen atoms, increases 
logarithmically with increasing partial pressure of oxygen. All these 
formulae are only valid, if there is an “exhaustion layer” (the layer A of 
Fig. 24). They seem to be approximately true in zinc oxide up to 300°C, 

If the entire thickness of the oxide layer is less than / the rate of 
oxidation is dependent on the partial pressure of oxygen even though 
the diffusion of metal ions to the surface is rate-determining. The 
reasoning is as follows. 

At the beginning of the oxidation and for a constant partial pressure 
of oxygen, which leads to a constant diffusion potential V , the electric 
field strength H(zx,) will decrease with increasing thickness x and we 
have E(z,) > E(l), if xz, < 1. Now it may be assumed that the majority 
of the electrons arrive at the surface of the oxide, initially, by means of 
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the tunnel effect and later, for a thicker boundary layer, by means of 
thermionic emission (cf. ScpotrKy and Mortt-CaBrera). If their con- 
centration is n we can put n > n,,, and n is practically independent on 
the thickness of the layer z,, if z, < /. Electrons reaching the surface 
must do so against the electric field. 

Let N be the maximum number of surface sites per square centi- 
metre available to chemisorbed oxygen «toms, and K be a constant. 
We obtain: 
re 


65) 
kT aie 


K . po,\* . n® exp |— 2£(z,) 


For small surface concentrations (n\ N) we have 
' 

- ° ‘ . r,e a 

. 2n\\? = K . Po,’ at n* ° exp | - 2E(z,) 7) . (66) 


where 


We finally obtain 


, kT . , 
E(zx,) In(po, . n* . K"*) — InE(z,). 
4xr,e ' 


In this equation /nE(x,) < E(z,), since E(z,)-is about 10° V/cm. If 
there is a strong electric field the current due to diffusion may be 
neglected compared with the current due to the electric field and we 
obtain from (576) 
Ix —= Nex . Bx ° E(z,)vx. ° . . ° (68) 
Combining (576), (67) and (68) we finally have for the current of 
cations (i.e. the number of the cations entering the surface per square 
centimetre per second), 
kT . 
Ip = Ng. » Par. in(p,. .s*. HK)... . (@) 
4xr,e ; 
Comparison of (576) and (66) shows that the rate of formation of thin 
films is proportional to the surface concentration of chemisorbed 
oxygen (I, «x E(x) x ni’). 
If the transport of cations to the surface determines the rate of 
oxidation of the metal, equation (69) gives the following rate law 


dx k by 
—s f 70 
di x i 


a Al 


(Inpo, + In(n*K*)) 
e ’ 
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and V,, and N are the molar volume of the oxide and Avogadro's 
number, respectively. Equation (70) shows that the rate constant of 
oxidation k is a logarithmic function of the partial pressure of oxygen 


k = const . Inpo, + const’ a 


The validity of equations (70) and (71) was experimentally confirmed 
for the oxidation of zinc by Moors and Leg. As shown in Figs. 25 and 
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Fig. 25. Oxidation of zinc at 390°C, in parabolic plot. 
according to Moore and LEE 
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Fig. 26. Logarithmic dependence of the parabolic rate constant k of 
the oxidation of zinc on the partial pressure of oxygen, according 
to ENGELL and HAUFFE, on the basis of measurements carried 
out by Moore and LEE 


26 the rate of oxidation follows the parabolic rate law and k is a 
logarithmic function of the partial pressure of oxygen, as is to be 
expected according to equation (71). 

WAGNER and GRUENEWALD also investigated the influence of the 
partial pressure of oxygen on the rate of oxidation of zinc and found 
k to be independent of po. This result is in good agreement with 
WaGNER’s theory of oxidation as well as with the views advanced in 
this article, for in WAaGNER’s experiments the condition z >/ was 
fulfilled. 
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This article does not aim at giving a complete review of all important 
experimental facts concerning the oxidation of metals and alloys. 
For example, there has been no reference to the interesting experiments 
on the internal oxidation of copper and silver alloys carried out by 
RHINES and associates®: ” and MegiserRtmnc and Ratwenac.” The 
main purpose has been to stimulate the metal expert by giving him some 
of the new theoretical points of view on this matter. I would like to 
express my gratitude to Dr. R. Lanpspere for numerous stimulating 
discussions, and for the linguistic help he has given me. 
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GASES IN METALS 


C. R. Cupp 


THE important relationships between metals and the elements normally 
thought of as gases has been under consideration for many years. In 
earlier times studies were confined mainly to solubility, diffusion, and 
embrittlement problems and casting defects caused by gases; now more 
subtle effects on the physical and mechanical properties of the metals 
are being considered. There are several books'~ and reviews*-™ dealing 
with the theme of this paper. Other reviews of a more specialized 
nature will be referred to throughout. The present paper is an attempt 
to summarize the more important experimental facts and to mention 
the significance of these as interpreted by various observers. In 
keeping with the purpose of this series, particular reference will be made 
to more recent observations, but the results of earlier work must, at the 
same time, be given due recognition. 

As in an earlier review,’ it should be pointed out that only those 
elements will be considered which exist as gases in their uncombined 
states, at normal temperature and pressure, namely, hydrogen, oxygen, 
nitrogen, the inert gases and the halogen gases. As a supplement to the 
list of references, a number of papers dealing with such combined 
gases as sulphur dioxide, carbon monoxide, carbon dioxide, hydrogen 
sulphide, etc., will be given, but no consideration of these will be made 
here. It is felt that purely adsorption phenomena, and corrosion effects 
due to gases (including surface oxidation) are not part of the subject at 
hand. Commercial degassing practice will not be reviewed, although 
some mention will be made of its necessity in various operations. 
Experimental methods will not be described in any detail; the reader is 
referred to the original literature for this information. This paper will 
deal with the solution and diffusion of gases in solid metals; with the 
absorption, retention, and evolution of gases by metals; and with 
effects upon the properties of the solid metals. The consideration of the 
marked effect of gases on many of the physical properties of metals will 
be the main theme of this work. 


GAS-METAL RELATIONSHIPS 


Fast has argued® that the permeability of metals to gases is related to 
the ability of a given gas to combine with a given metal, diffusion rates 
being high where compound formation is possible. Rutnes’ has 
amended this statement to say that diffusion is appreciable in those 
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gas-metal couples in which the gas is soluble in the metal, to a significant 
degree. SmiTH* has made a more pointed generalization, in the case of 


hydrogen at least, by emphasizing that increased permeability of the 


metal, however produced, is always accompanied by an increase in the 
‘occlusive capacity” of the metal for hydrogen. At any rate if one 
considers that the halogen gases, although they react actively with 
practically all metals, have been found not to diffuse to any marked 
degree through any metals, or to dissolve in appreciable amounts in 
solid metals, then it must be assumed that Fast’s* statement is not 
correct. 


Ine rt (;ase & 


Turning to the role of inert gases, there can be no compound formation 
and the possibility of solubility is questionable. It has been reported 
recently* that the amount of argon absorbed by uranium at room 
temperature is less than | part in 3 x 10° by weight and at 600°C, 
less than | part in 3 x 10*, which is the magnitude of the uncertainties 
of measurements of the gas volume. Elsewhere" it is indicated that 
inert gases are not dissolved to any measurable extent in solid metals. 
Similarly the evidence of diffusion of inert gases through metals is not 
well confirmed. The earliest report found in the literature’ showed that 
there is no permeability of metals to inert gases. From preliminary 
investigations” of the removal of inert gases in discharge tubes at low 
pressures it was erroneously concluded that the gas penetrates into the 
cathode as ions, while the cathode metal is disintegrated and rede- 
posited by diffusion. Later work of the same nature showed quite 
conclusively”® that the apparent loss of neon from lamp tubes was due 
not to the passage of the gas through iron either at room temperature or 
at a bright red heat, but to an apparent adsorption upon the iron under 
the influence of an electric discharge. As a very brief summary of the 
work by Lumps and SEELIGER" one would say that they found that, in 
the discharge tube, the inert gas particles were “‘shot into’’ (“‘hineinge- 
schossen’’) the cathode and there “remained stuck”’ (“‘stecken bleiben’’) 
in some sort of mechanical trap. A curve showing the distribution of 
the particles indicated a greater concentration at the surface. On 
heating to a few hundred degrees Celsius the gas particles are released 
and “‘diffuse’’ (‘‘diffundieren”’) out of the cathode. No claim was made 
that there was any true equilibrium solubility of the inert gas in the 
iron cathode, and it was not suggested that diffusion proceeded to 
cause a uniform distribution of the gas throughout the cathode. 

One other experiment, of doubtful value,’’ indicated that at 505°C 
and a gas pressure of 72 cm Hg the diffusion of helium through alumi- 
nium was as rapid as that of hydrogen. The measured rate was 
0-026 x 10°? ml N.T.P. x mm thickness per sq cm heated area per 
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second. Since this anomaly was not investigated further it must remain 
subject to doubt. 


Nitrogen 

In his recent review RHINES® points out that the compound rule pro- 
posed by Fast® seems to hold in a general way for oxygen, nitrogen 
and hydrogen, but the correlation between solubility and diffusion is 
better. He states further : 

“Nitrogen forms compounds with, is soluble in, and diffuses through, 
most of the metals of the last four groups of the periodic table of the 
elements. It is generally nonreactive with, insoluble in, and fails to 
diffuse significantly through, the first-group metals and only sparingly, 
when at all, through the metals of the second and third groups.”’ 

At an earlier time SMITHELLS! made the following generalization : 

“Nitrogen is only soluble in those metals which are capable of forming 
nitrides. We have previously noted the same distinction both in 
activated adsorption and diffusion, and the close connexion between 
the three kinds of gas-metal equilibria is again emphasized. Nitrogen 
has been shown to be insoluble within the limits of the experimental 
method in cobalt,’** copper,’** silver,?°* and gold.* Those metals 
which, like iron and molybdenum, absorb nitrogen, show the same kind 
of changes in lattice structure, mechanical and electrical properties as 
are shown when the hydride-forming metals absorb hydrogen. The 
nitrogen-metal systems have not been investigated so completely as 
those of hydrogen, but the close parallel between them is evident from 
the data.” 

Later SMITHELLS’ stated that “‘nitrogen is soluble in a limited number 
of metals and although there is evidence of simple solubility in some 
cases, all metals which dissolve this gas are known to form definite 
nitrides under suitable conditions.” 

An examination of the literature following this brief statement will 
indicate that SMITHELLS’ generalization is probably closer to the truth 
than Runes’ statement. It is recognized that some of the errors in 
LHINES’ statement may be due to the type of periodic table being used. 
Nitrogen is generally soluble in groups VA, VIA and VIIA but (except 
for iron) not VIII, if those are to be taken as the “last four groups.” 
It is also soluble in group IVA. RuHINEs is quite correct about nitrogen’s 
relation with groups [A and IB. There is some ambiguity in his state- 
ment regarding the second and third group elements, where nitrogen 
reacts with several elements to form nitrides but has not been reported 
to dissolve in the same elements. SMITHELLS’ statement seems to be 
more generally acceptable. 

On the basis of the two statements above, it seems worthwhile to 

* This number refers to the list of references at the end of the present paper. 
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investigate the literature to determine which metals do form nitrides 
and to find whether these do dissolve nitrogen under other conditions. 
Table 1 shows the elements of interest and their positions in the 
periodic table. WerELLS*® reports that most elements combine with 
nitrogen, and the nitrides include some of the most stable compounds 
known. He reports that the ionic nitrides include those of lithium, 
beryllium, magnesium, the metals of ITA and IIB, germanium and 
thorium, but not those of the IVA metals of the type TiN. A nitride 
of mercury, Hg,N,, also occurs but may be different in structure from 
the other Group IJ nitrides. The nitrides of beryllium, magnesium 
and calcium in Group IIA are described as being anti-isomorphous with 
oxides M,O,, i.e. the metal atoms occupy the positions of the oxygen 
atoms and the nitrogen those of the metal atoms in the M,O, structure. 
Thus the structure becomes a compound of the type M,N). 

Covalent nitrides include the nitrides of Group III. These have 
great chemical stability and are described by WELLS* as being covalent 
nitrides. These compounds are formed of equal numbers of atoms of 
elements of Group III and V, and the nitrides of aluminium, gallium 
and indium crystallize with the wurtzite structure. 

WELLS” reports numerous nitrides of the transition elements of varied 
structures. The characteristic feature of these nitrides is that the small 
non-metal atoms occupy some or all of the interstices of the metal 
lattice. The relative positions of the metal and nitrogen atoms is 
controlled by the number and nature of the interstices occupied by the 
nitrogen atoms, and they may be the same as in certain simple structures 
such as the sodium chloride and calcium fluoride structures. The 
following compounds have been reported to crystallize with the rock- 
salt structure: ScN, CeN, LaN, TiN, ZrN, VN, NbN, PrN. 

We have now determined which metals will form nitrides at higher 
temperatures; these same metals may dissolve nitrogen at lower 
temperatures. Quantitative measurements are not important here but a 
review of the literature, indicated in Table 2, will show that there is a 
definite solubility of nitrogen in the metals listed. 


TABLE 2 





Metal References Metal References 


Iron 3, 10, 23, 24, Hafnium 
Zirconium , , 27, 28, 28 Thorium 
Tantalum 3.3 Vanadium 
Molybdenum , 27, 30, 31, 32, ‘ Niobium 
Tungsten , 3, 10, 27, 3% Chromium 
Manganese 

Titanium 
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Hice™ and SmITHELLS and Rooxssy® have found that, although 
tungsten reacts very slowly to form the nitride W,N at high tempera- 
tures, no solid solutions of nitrogen in tungsten are formed. 

It has been shown* that in the ranges investigated in the experi- 
mental work, there is no solubility of nitrogen in cadmium, thallium, 
zinc, lead, bismuth, tin, and antimony. More recently it was found™ 
that there is no solubility of nitrogen below 1200°C in nickel, cobalt, 
copper, and tungsten. In the references above, which indicated that 
tungsten would dissolve nitrogen, the measurements were made above 
1200°C, the solubility being 0-304 cm*/100 g at 2400°C, so that it is 
rather small if it exists at all. 

RONTGEN and Bravwn* found that no measurable amount of nitrogen 
is absorbed by aluminium between 700°C and 1000°C because a surface 
film of aluminium nitride is formed which prevents diffusion of the gas 
into the metal. At most there may be | c.c./100 g in the molten metal 
at the melting point, but the solubility in the solid metal would be 
much smaller than that. On the other hand, Caacpron® and 
Morgeac®™: ” found relatively large amounts of nitrogen (in one case 
7-4.c.c./100 g in the solid metal) dissolved in aluminium which had 
been electrolytically polished so that no adsorbed layer would affect 
the results. Their method of degassing was to make the specimen the 
cathode in a vacuum discharge tube and apply a potential to the 
terminals of from 5 to 150 kV with a current of about 5mA. Degassing 
took place in the cold, the surface molecules (of aluminium nitride 
presumably) being ionized and drawn into the electric field, replace- 
ment being effected by diffusion in the metal. In hydrogen-metal 
systems the experimental results presented by these workers all appear 
to indicate a much higher solubility than one would expect. It is felt 
that their findings in the aluminium-nitrogen system may possibly 
result from the decomposition of the aluminium nitride compound and 
do not, in fact, represent the true solid solubility of nitrogen in 
aluminium 

As was stated earlier, it seems that SMITHELLS’ statement that nitro- 
gen is only soluble in those metals which are capable of forming 
nitrides, is the most applicable generality that can be made about the 


nitrogen-metal systems. 


Uxyge n 

The measurement of the solubility of oxygen in metals is quite difficult, 
since, except in the case of the noble metals, a separate oxide phase 
is formed when the limit of the solid solubility is exceeded. Thus the 
true solubility of the oxygen may be small in comparison with the total 


amount of oxygen present in the metal. In many molten metals the 


large content of dissolved oxygen is sharply decreased on solidification 
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when the gas may be evolved or may form an oxide of the metal. 
Generally, the solubility of oxygen in metals is of the same order of 
magnitude as that of hydrogen, but it is usually more difficult to 
determine. Unless the metal being used in an investigation is very 
pure, then the usual method of establishing equilibrium by diffusion 
inwards from an external oxide phase is liable to lead to erroneous 
results. As will be shown later, the gas used for preferential internal 
oxidation of impurity elements to form more stable oxides than the 
metal being examined, may be interpreted as gas going into solution. 
Thus a large error may be introduced where the true solubility is, in 
fact, quite small. 

The oxygen-copper system seems to have been investigated more 
thoroughly than any others. Hanson, MarryatT and Forp* placed 
the solubility limit at less than 0-009 per cent oxygen at 1000°C and 
ALLEN and STREET suggested it to be not over 0-005 per cent oxygen 
at 500°C. The earliest data on spectroscopically pure copper were given 
by Rates and MatHewson,* who reported that the solid solubility 
increases from 0-007 per cent by weight at 600°C to about 0-015 per cent 
at 1050°C. A later paper by Parties and Skinner“ showed that 
previous determinations had given results which were too high. They 
found the solubility to range from 0-0022 per cent at 700°C to 0-0077 
per cent at 1050°C. The probable reason for the apparent difference of 
solubility is the experimental difficulties in the analytical determination 
of such small percentages of dissolved gas. 

Usually the solubility of oxygen in metals is complicated by the 
formation of oxide phases, so that complex phase diagrams result. A 
diagram given by BaRRER’ illustrates the great variety of solid solutions 
possible. Earlier Herty and Garnes® showed that oxygen must 
dissolve in liquid iron as FeO, and probably dissolves in the same 
manner in the solid metal. The maximum solubility in alpha-iron is 
probably less than 0-05 per cent and may be slightly higher in 
gamma-iron. 

In the cobalt-oxygen system, SEYBOLT and MaTHEwson® found from 
analysis and micrographic observation that the solid solubility of 
oxygen in cobalt increases from about 0-004 per cent by weight at 
550°C to 0-020 per cent at 875°C, at which temperature the hexagonal 
form of cobalt changes into the face-centred cubic form. In the gas-free 
metal this transformation occurs at 850°C. In the cubic cobalt, the 
oxygen content increases almost linearly at constant pressure from 
0-004 per cent at 875°C to 0-022 per cent at 1450°C. 

A similar solubility of oxygen in the form of nickel oxide (NiO) has 
been reported by Merica and WaLTENBERG.* 

The solubility of oxygen in silver has been studied by STEacre and 
JoHnson,*® who showed that it is proportional to the square root of 


11] 





PROGRESS [IN METAL PHYSICS 


the pressure, is not in any simple relation to the temperature, and is a 
minimum at 400°C. They reported that, at a pressure of 80cm of 
mercury, the solubility varies from 0-142 c.c. of oxygen per c.c. of 
silver at 200°C to 0-087 c.c. at 400°C, 0-193 c.c. at 700°C, and 0-354 c.c. 
at 800°C. They suggest that below 400°C, the dissolved oxygen exists 
in the form of Ag,O, and above this temperature as oxygen atoms, since 
Ag,O is unstable at the higher temperature. Smmons* argues that the 
minimum is due to the rate of increase of the dissociation pressure of 
Ag,O with temperature being greater than the rate of increase in 
solubility of Ag,O in silver. 

SMITHELLS*’ indicates that oxygen is probably slightly soluble in 
palladium and rhodium, but there are no published data. Oxygen is 
insoluble in mercury, gold, osmium, iridium, and platinum. If there is 
any solid solubility of oxygen in tin and lead it would be very small. 
GROSHEIM-KRISKO, et al® report that the oxygen content of molten lead 
at 350°C to be 48 — 60 « 10~° weight per cent. Solid solutions con- 
taining up to 40 atomic per cent of oxygen without the separation of 
an oxide phase are possible in zirconium™:*." and titanium.” 
Oxygen is also soluble in vanadium, niobium and tantalum. 


Hydrogen 
In the metal-hydrogen systems there are four different types of inter- 
action between the gas and the metal, as follows: 


1. Hydrogen will react to form hydrides of a salt-like nature with 
the alkali and alkaline-earth metals. These primary salt-like 
hydrides are ionic, and generally crystalline, non-volatile com- 
pounds. These compounds, with the negative hydrogen or 
hydride ion H~- are analogous to the corresponding halides. 
Grps™ and Runpie™ have fairly recent papers dealing in some 
detail with the formation of these hydrides, and their references 
will lead to a broader view than need be given here. 

With the metals of Group IVB, VB and VIB covalent hydrides 
are formed yielding compounds, gaseous at room temperature, 
analogous to the hydrocarbons and typified by the familiar 
arsine. Some of the covalent hydrides formed are AsH, and 
SiH,. 

A number of metals, such as chromium, iron, cobalt, nickel, 
platinum, copper, silver, molybdenum, magnesium, and 
aluminium, dissolve hydrogen to form true solutions as indicated 


by the fact that (a) the hydrogen solubility varies as V P, and 
(6) solubility increases with increasing temperature. 

With the rare-earth metals and titanium, zirconium, tho- 
rium, hafnium, vanadium, niobium, tantalum, uranium, and 
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manganese, hydrogen forms what DusHman”™ calls “‘pseudo- 
hydrides.”’ Smrrn? describes these metals as being ‘exothermic 
occluders,’’ while those in section (3) above are “endothermic 
occluders.” Although the solubility varies as VP for certain 
ranges of values of pressure (P), it decreases inversely as the 
temperature. Palladium behaves similarly with respect to 


hydrogen. 


As far as the writer has been able to determine, hydrogen is not 
absorbed by tungsten (may have small solubility), gold, zine (?), 
cadmium (?), indium, thallium, or mercury. Uranium has been re- 
ported to have a small solubility of hydrogen in one reference®’ and to be 
an “exothermic occluder”’ in another reference.” 

Solid tin and lead have a very small capacity to absorb hydrogen. 


GaS-METAL Eqvuiriispricm, Dirrvsion AND SOLUTION 


It has been stated in the introduction to this paper that the effects of 
gases on the physical and mechanical properties of metals will be the 
main subject of study. However, the chemical considerations of the 
overall problem must not be entirely overlooked. In the preceding 
section, the various possible gas-metal combinations were reviewed in 
an attempt to show where one might expect to find unusual physical 


effects. In comparatively recent years there has been an increasing 
awareness of the profound effects which gases may have on solid metals. 
Much more study is needed, however, before a reasonable theory can be 
developed to cover the entire gas-metal field. 

The formidable volume of work which has been devoted to the study 
of gas-metal equilibria is indeed impressive. It is felt that it is better 
not to consider this field at all, rather than to create a false impression 
by a too brief review; therefore, references will be made to a few fairly 
recent reviews devoted largely to this phase of the work. The reader is 
warned, not unjustly it is felt, to be extremely critical of solubility 
measurements and diffusion rates particularly for oxygen. As an 
example of this, Evans® has listed the reported solubilities of oxygen 
in iron as found by fourteen different groups. The results vary from 
6 to 0-006 per cent oxygen (at 1000°C), a marked variation. Evans 
considers the serious discrepancy to be due to: (a) effects of traces of 
impurities which are often not reported in the literature, (b) the use of 
powder samples which adsorb large amounts of gas, which is reported 
as dissolved gas, (c) contamination of the iron by refractories or volatile 
compounds, (d) insufficient time allowed for equilibrium conditions to 
be reached, and (e) variation in the methods used, and consequent 
variation in the inherent errors. Similar types of errors are probably 
the cause of variations in the reported solubilities in other systems. 
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Most papers on gases and metals refer to SMITHELLS’ monograph! 
(and occasionally to a paper’) which is now, justifiably, a classic in this 
field. In these treatments, SMITHELLS deals with the three processes 
involved between gases and metals: adsorption, diffusion and solution, 
and he shows how the laws governing these phenomena have been 
evolved. 

In the few years after the publication of SmMITHELLS’ book, the 
subject of gases and metals advanced rapidly, so that by 1941 there was 
a need for a comprehensive review of the subject. Barrer’ fulfilled at 
least part of the need by devoting a long section of his valuable mono- 
graph to the subject of diffusion of gases through solid metals. In this 
survey, BaRRER dealt with the work which had been done up to that 
time, and developed a reasonable explanation for the phenomenon of 
gaseous diffusion through solid metals. The present author feels that 
this monograph is still to be highly recommended, although it is in need 
of revision. 

Shortly before BarreR’s book was published, Stmons and Ham*®* 
recorded their belief that the solution and diffusion of gases through 
metals could be considered as purely chemical phenomena. It was 
claimed that the specific nature of the solubility of gases in metals 
indicates that the gases are present mainly as compounds with the 
metal system. Since the rate of diffusion of a gas through a metal is 
inversely proportional to the thickness of the metal, it was felt that the 
adsorbed film of gas, usually postulated in diffusion processes, could not 
be related to the empirical results. For this and other reasons, diffusion 
was, therefore, treated as a case of chemical equilibrium between the 
metal and the gas, and on this basis the usual diffusion laws were derived 
without reference to the mechanism of the reaction at the gas-metal 
interface. 

STEARN and Eyrre*’ briefly considered the diffusion of gases in 
metals from the point of view of a theory of the absolute rates of 
reactions in which the slow process of diffusion entails the surmounting 
of an energy barrier. In this paper some doubt was thrown on the 
possibility of “‘rift’’ diffusion which had been suggested as a mechanism 
by Smiru and his co-workers.” 

Fast®* has discussed the mechanism of diffusion of gases in metals, a 
subject about which, he says, “. . . very little is yet known with 
certainty.’ In general the gas, in atomic (or ionic) form, penetrates 
between the metal ions of the lattice. Unlike Smmons and Ham,” 
Fast thinks that the surface effect and not the diffusion itself is 
frequently the deciding factor in the diffusion velocity. In a later 
paper Fast®® considered further the permeability of gases through metal 
walls. As a result of the diatomic character of the gas phase, there are 
five stages involved in the penetration of gases through metal walls: 


114 





GASES IN METALS 


(1) dissociation of molecules into atoms or ions at the surface of entry, 
(2) penetration of the atoms or ions into the metal, (3) diffusion in the 
metal, (4) a transition from the dissolved to the adsorbed state at the 
exit surface, and (5) reassociation of the components into diatomic 
molecules. The speed of the whole permeation process is determined by 
the slowest of these five successive steps. Examples were given of 
experimental cases where each of these five stages is the controlling 
factor and these were discussed in some detail. The influence of the 
surface condition on permeability was explained, and data were given 
to show the effect of temperature on permeability. 

Ruies® has written a general review of a number of industrial 
processes which are influenced by gas-metal diffusion which may have 
either beneficial or detrimental effects. The information for that paper 
was gathered by consulting people who were closely associated with the 
various practices in which gas-metal diffusion plays a part. Some of the 
phenomena discussed were: hydrogen embrittlement, subsurface 
oxidation, and rejection of dissolved gases on solidification of molten 
metal. 

In 1948, BarreR®™ reviewed the factors determining the formation 
of interstitial compounds and some of their properties. While the 
formation of compounds is not really within the scope of the present 
paper, this discussion by BaRRER is extremely interesting and valuable. 
It is noteworthy that Barrer, unlike many authors who have written 
about gas-metal systems, regards the interstitial hydrogen dissolved in 
a metal to be substantially unionized. The paper deals with the 
exotherma! formation of solutions of hydrogen in metals, equilibrium 
between interstitial and free gas, and rate-controlling processes. 

In the same year, three papers appeared which apply to this discus- 
sion. Lepp® studied the absorption of gases in metal by applying 
physico-chemical laws and thermodynamic analysis. SacHse®™ gave 
detailed abstracts of studies (conducted in Germany during the late war) 
of the diffusion of hydrogen and deuterium in palladium, and of hydro- 
gen in iron. ScHarer® reviewed the work of SrrvertTs done during the 
war years on the solubility of hydrogen and deuterium in nickel, 
niobium, and palladium, and on the effect of hydrogen on the transition 
points in some iron alloys. 

A monograph recently published by Smrru? deals entirely with the 
problem of hydrogen in metals. This monograph covers the subject 
with remarkable ,completeness, and the bibliography of some 1470 
references is very useful. However, very early in his book, SMITH 
questions the possibility of the existence of solid solutions in “‘endo- 
thermic metal-hydrogen systems,” and throughout the remainder of 
the book the discussion is dominated by his “‘rift’’ hypothesis. It is 
claimed that the various experimental phenomena can best be explained 
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by the opening or closing of these “‘rifts’’ which are apparently lattice 
defects, or widened lattice intervals. One is impressed by the fact that 
the rift hypothesis is entirely qualitative. The concept of rifts might 
also be questioned if the specificity of diffusion rates and solubility data 
are considered. The present writer must agree with RaNnsLEy,® who 
said of SmirH’s monograph: “Indeed, it is not unfair to say that in a 
good many instances the author has strained the facts very severely in 
an attempt to make them conform to this theory, when a simple explana- 
tion based on lattice solution and diffusion would appear to be perfectly 
adequate.”’ There are, of course, cases where the rift hypothesis may 
very well apply, and where it is, in fact, the preferred explanation. 
However, the explanation of such a formidable mass of empirical 
results by one mechanism does not yet seem possible. Since the 
publication of this book, Smita has written three papers incorporating 
the rift hypothesis in each. *, ® 

As part of a book on vacuum techniques, Dusaman”™ included a 
section (Chapter 9) on gases and metals. Although much of the experi- 
mental data in this section was published before 1940, the review is 
valuable because it does bring much of the available information under 
one cover. DusHMAN, among other things, considered the diffusion 
and solubility of hydrogen, oxygen and nitrogen. 

In 1950, Bastren® discussed the various stages in the permeation of 
a gas through a metal wall. The consideration, which deals particularly 
with the iron-hydrogen system, includes adsorption and diffusion 
phenomena, the effects of temperature and pressure, and effects of 
gas-metal equilibria. 

Recent work by Buat and Lioyp” has shown that the effect of 
cold-work is to decrease the rate of hydrogen diffusion in steel at room 
temperature. It was pointed out that diffusion results obtained by 
electrolytic experiments are only qualitative; this might account for 
many of the disagreements between solubility and diffusion measure- 
ments in other hydrogen-metal systems. For reasons which will not be 
discussed here, the authors believed that, although variation in the 
lattice spacing could affect diffusion rates, probably the occluded 
molecular hydrogen stored at crystallite boundaries is the major 
factor affecting the diffusion process. The correlation of cold-work and 
rate of hydrogen diffusion with changes in hardness and density are 
also discussed. This work is related to the studies conducted by 
ANDREW on hair-line cracks and embrittlement in steel (q.v.). 

ISENBERG” has written a brief note to demonstrate why hydrogen 
ionizes when dissolved in a metal. On a semi-classical model the 
conclusion was reached that hydrogen should ionize in all metals. 

A series of studies by GULBRANSEN and ANDREW*~* on the reactions 
of zirconium, titanium, niobium and tantalum with oxygen, nitrogen 
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and hydrogen provides an excellent review of the work which has been 
done on these metals. The data found by GuLBRANSEN and ANDREW 
on the kinetics of the gas-metal reactions add considerably to the 
knowledge of these metals. At present it is difficult to follow the 
latest work on titanium and zirconium, for so much experimental work 
is in progress. The series noted might provide a useful starting point 
for a reader particularly interested in this work. 

A recent review by Rxrngs,* although very short, summarizes some 
of the experimental facts of gas-metal diffusion processes. This seems 
to be the most recent review of this field. 

Although not directly applicable here, it is interesting that equili- 
brium relations in certain (titanium) alloy systems have been investi- 
gated by measuring hydrogen pressures in equilibria with very small 
amounts of hydrogen in solution in the alloys. McQumLan’*-”* has 
studied the alpha — beta transformation in various titanium samples, 
and the effect on the transformation of alloying elements. 


Errects or DissoLvVED GASES ON THE PHYSICAL AND 
MECHANICAL PROPERTIES OF METALS 


In addition to such effects as hydrogen embrittlement of various metals, 
and porosity caused by gases coming out of solution, there are several 


phenomena of interest which are directly attributable to dissolved gases. 


Internal Friction* 


One of the most interesting of these phenomena is the effect which 
absorbed gases have on the relaxation behaviour of metals. A com- 
prehensive theoretical treatment of relaxation phenomena, to which 
the reader is referred, has been given by ZENER.®® One type of relaxa- 
tion effect is internal friction. It has been found that, under suitable 
conditions, there will be a maximum absorption of energy by a metal 
when an applied vibrational stress has a frequency of the same period 
as that of some mechanism taking place within the metal itself. Such 
mechanisms as reversible grain boundary movement, stress-induced 
diffusion, and thermal diffusion have been postulated as causes for the 
marked dissipation of energy which has been observed in some systems. 

The relationship between magnetic and mechanical relaxation pheno- 
mena was first mentioned by Snogexk.* It had been pointed out by 
YENSEN*® that small quantities of both carbon and nitrogen have a 
marked influence on both the magnetic and plastic properties of iron. 
It was thought that the slow decrease with time of the magnetic 

* This section on Internal Friction is presented only as a brief description of this 
phenomenon which is one manifestation of the effect of gases on metals. A comprehen- 


sive review of Internal Friction in Metals by A. 8. Nowick is presented elsewhere in 
this volume. 
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permeability in iron at room temperature was due to the segregation of 
nitrogen. Very little exact knowledge was available, however, until 
Syoek™ found that such phenomena as magnetic after-effect and time 
decrease of permeability depend on the presence of carbon or nitrogen 
in solid solution. If these solute atoms are precipitated from the 
lattice the phenomenon of magnetic after-effect is no longer present. 
Ricuter™ then found that samples of iron which had displayed a 
magnetic after-effect also showed an elastic after-effect. Both effects 
were temperature-dependent, and the absolute values of the relaxation 
times were nearly equal. Finally, SvozK*® was able to show that both 
carbon and nitrogen are capable of producing an elastic after-effect 
(relaxation peak) in iron, thus cor- 
relating the magnetic and mechani- 
cal phenomena. SNOEK mentioned 
that there was a true nitrogen solu- 
bility at room temperature and 
that this could be measured by 

© Jetrahedre/ intersites eans of the elastic after-effect. 
sition [4 By 1941 SwNoek was able to 
; account for these effects on the 
basis of the interstitial positions in 
a body-centred cubic lattice which 
Fig. 1 Examples illustrating the have tetragonal symmetry, the 
octahedral and tetrahedral positions tetragonal axis being parallel to 


in a body-centred cubic lattice. The any one of the three principal axes. 
closest atoms to these positions are 


etheaad tap Gadiall aul dated Quen In an unstrained material, the 

Kee equilibrium distribution of the 

solute atoms among the three 

types of interstitial positions (given by the crystallographic indices 

$00), (040), and (004)) is quite random. However, if the material is 

elastically stressed in the z direction, there will be a preferred distri- 
bution of solute atoms in the z-type positions (400). 

The above explanations apply to the case of annealed material. The 
results of SNOEK’s work with cold-worked iron were more complex and 
his explanation only tentative, and admittedly unsatisfactory. 

This relaxation peak discovered by SNOEK appears to be characteristic 
of all body-centred cubic metals. More recently, K& has studied three 
peaks in tantalum, corresponding to carbon, nitrogen and oxygen in 
interstitial solid solution. 

Ke* has elaborated on the theory developed by SNoek, and has 
done experiments, using tantalum, which have extended the empirical 
knowledge of the phenomenon considerably. His observations are the 
first experimental evidence that oxygen forms an interstitial solid 
solution with a metal. K& has emphasized the important observation 
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that the internal friction peak for oxygen cannot be interpreted in 
terms of a single relaxation time with a unique activation energy. 
The significance of this will be discussed later. 
Ké has proposed a simple picture of this relaxation effect which 
may be described with reference to Fig. 1, which represents a body- 
centred cubic lattice. If solute atoms occupy the octahedral inter- 
stitial positions in the lattice, then the lattice will be subjected to 
tetragonal deformation with the tetragonal axis along one of the (100) 
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Fig. 2. Internal friction curves for original tantalum specimen and for 
the oxygen-enriched specimen. Both were determined with two 
frequencies of vibration (Kf&**). 


directions, depending on which position the solute atom has occupied. 
If a tensile stress is applied along one of the (100) directions, then the 
solute atom will prefer to occupy a position such that the tetragonal axis 
mentioned above will be along the same direction. Thus, under fluctuat- 
ing stress, solute atoms will “diffuse” in an attempt to maintain this 
state. Under suitable conditions of temperature and applied frequency 
this “diffusion” process will absorb large amounts of the applied 
vibrational energy. 

Since all interstitial positions are equivalent in the (111) direction 
then a tensile stress applied in that direction should produce no relaxa- 
tion effect. DisksTra*’ has confirmed this by showing that the relaxa- 
tion strength along a (111) direction in single crystals of alpha-iron 
is only about 5 per cent of that along a (100) direction 
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In his experimental work, K&* has determined that the oxygen 
content in solution in tantalum is greater than 0-06 weight per cent 
at 170°C 

In Fig. 2 are shown the internal friction peaks determined at two 
frequencies of vibration caused by carbon (0-013 per cent) in tantalum. 
The higher peaks in the same figure are caused by the composite effect 
of carbon and oxygen which diffused into the metal on annealing in an 
oxygen atmosphere. The contribution due to oxygen alone was 
determined by subtracting the carbon curve value of internal friction 
from the composite value at the same temperature. The internal 
friction peak due to oxygen alone was found to occur at about 20 C 


higher than that due to carbon 

In Fig. 1 it is shown that there may be two kinds of interstitial 
positions for the solute atoms, and these are shown with their crystallo- 
graphic indices. Each kind of position shown may take up one of three 
types of position, each having the tetragonal axis along one of the 
principal axes, as mentioned before. Despite the different kinds of 


position which the solute atom may occupy, the mechanism of preferen- 
tial distribution will be the same in each case. K& has emphasized that 
the distortion produced when the solute atom is in the octahedral 
position is much higher, and the internal friction peak, therefore, 
greater 

Since carbon atoms occupy the octahedral positions in martensite, 
it is assumed by K& that it is easier for the carbon atom to displace the 
two nearest neighbours of the octahedral position than it is to move the 
four atoms equidistant from the tetrahedral position. Regardless of the 
fact that the octahedral position is smaller than the tetrahedral with 


8 


respect to nearest neighbours, this view is held to be correct. Since 
tantalum has an atomic radius greater than iron, the available inter- 
stitial space in the tantalum lattice will be larger, and the carbon atoms 
can enter the octahedral position. K& was able to demonstrate that 
the carbon peak at 150°C has a single relaxation time and a unique heat 
of activation. This indicates that the carbon is distributed in only one 
kind of interstitial position. If this is the tetrahedral position (for the 
150°C peak) then there should be another, larger, peak associated with 
the octahedral position. This was not found, so it is assumed that the 
octahedral position was the preferred location for the carbon atom. 
The characteristic internal friction peak for the metal containing 
oxygen in solution showed that there is no single relaxation time, and 
no unique activation energy. It is assumed from this that the oxygen, 
with an atomic radius smaller than carbon, may go into either inter- 
stitial position. K#** has indicated how the diffusion mechanism from 
one position to another will determine whether or not a single relaxation 


time is associated with the process. 
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Ké*. % also measured the relaxation strengths of tantalum specimens 
containing small amounts of carbon and oxygen. Experiments at 
temperatures ranging from 350° to 700°K indicated that a linear rela- 
tionship exists between the reciprocal of the relaxation strength and the 
absolute temperature. Since this plot is a straight line passing through 
the origin, it is concluded that the interaction between solute atoms is 
negligibly small, and that the critical temperature for self-induced 
preferential distribution of solute atoms is close to absolute zero. This 
linear relationship is held to be in agreement with the previously 
postulated theory of stress-induced preferential distribution of solute 
atoms. No spontaneous ordering can occur in the solutions of carbon 
and oxygen in tantalum. 

It had been reported in an earlier paper** that an internal friction 
peak was found to occur in tantalum (with a vibration frequency of 
about 1 c.p.s.) at 350°C, as well as those at 150°C and 170°C, the latter 
two being associated with carbon and oxygen respectively. K&™ later 
showed that the 350°C peak is associated with the presence of nitrogen 
in tantalum. It is seen that the temperature of the peak is abnormally 
high, and experiments were undertaken to decide whether the pheno- 
menon was due to stress-induced diffusion as for carbon and oxygen, or 
to viscous behaviour of the boundaries between tantalum and tantalum 
nitride. It was found that tantalum, annealed in nitrogen at 1200°C, 
and quenched from 900°C, had an initially high internal friction at 
350°C but this decreased to a much lower value after 20 hours. Similarly 
annealed material quenched from 600°C did not have as high an initial 
value of internal friction and this also decreased to the same equilibrium 
value after 20 hours’ annealing at 350°C. During this time, tantalum 
nitride needles were precipitated from the supersaturated solid solution 
of nitrogen in tantalum. From this it was concluded that the internal 
friction observed could not be due to nitrogen in precipitated form, since 
then the internal friction should have increased while precipitation 
proceeded. 

This precipitation of nitrogen from the solid solution with tantalum, 
as indicated by the internal friction measurements, is similar to that 
observed by DiskstTra*’ in the cases of carbon and nitrogen precipi- 
tating from the solid solutions with alpha-iron. 

Ké found the heat of activation associated with the 350°C absorption 
peak to be about 44,000 cal/mole, which is much larger than the 
values for oxygen in tantalum (29,000 cal/mole) or for nitrogen in 
alpha-iron,® which is 20,000 cal/mole. The observed nitrogen-tantalum 
peak could not be interpreted in terms of a single relaxation time with a 
unique heat of activation. Thus the same mechanism of solution is 
held to be true for nitrogen as was postulated for oxygen in tantalum, 
i.e. the nitrogen may be in either the octahedral or tetrahedral positions. 
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The extremely high heat of activation, and with it the relatively high 


temperature at which the peak occurs, was not explained. It was 
suggested that these phenomena may be due to a larger distortion in 
the lattice 

Ke has also done work on iron and found three internal friction 
peaks to occur at 20°, 225° and 490°C. The 20°C peak is believed to be 
of the same origin as that observed by SNoexk.*' The mechanism causing 
this peak, according to K&, SNoek and also DiyKsTrR«,*’ is the anelastic 
effect caused by nitrogen in solution 





a7 2 
which has been described for the case of 


carbon in tantalum. The 225°C peak 


am 
?| which had been observed previously by 
— © _ff |} Syorx* and West” was found by Ké 
| “ee 
| to be caused by the stress-induced diffu- 


Y sion of nitrogen atoms, as in other 
\ eases, but “‘within some peculiar type 
of stress regions created in the specimen 
after it was subjected to cold work.” 
No account of this mechanism has been 
given but the heat of activation for this 

ee diffusion is known to be 32,000 cal/mole 
Temperature cand the diffusion distance is in the order 











of a few atomic diameters. The 490°C 


Fig. 3. Effect of heating in air 2 ee 
on the grain-boundary relaxa- peak was found by K& to be due to grain 
tion in silver. (The steadily boundary relaxation in alpha-iron. 

rising internal friction which In a recent review on stress relax- 
remains after the peak : 00°C : : a 
remains after the peak at 4 ation across interfaces, Nowirck™ has 
has disappeared, results from 


ativan gamma Chasmeantt indicated that small amounts of im- 


purities, microscopically undetectable, 
cause blocking of the interface relaxation when lodged in the 
grain boundaries. These impurities may cause partial or complete 
disappearance of the grain boundary internal friction peak. This type 
of peak is not due to the same causes as the one which has been discussed 
in this section thus far. It is suggested that the reader see NowIck’s 
paper™ and the references contained therein for a complete picture of 
this phenomenon. Oxygen, probably in the form of oxides, is quite 
effective in decreasing the grain boundary peak in copper and silver. 
Fig. 3 shows how heating polycrystalline silver in air can reduce the 
grain boundary peak almost to zero. A similar effect has been observed 
for copper. The picture of this effect which Nowick™ has reiterated is 
that the foreign particles ‘“‘create new corners at which slip cannot take 
place,”’ so that the total relaxation may be smaller than in the pure 
substance. According to his hypothesis, “*. . . it is to be expected that, 
as impurities are added systematically, the grain boundary peak 
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should not only decrease in magnitude but also move toward lower 
temperatures because the effective size of the interfaces is shortened 
and, therefore, becomes smaller.”’ 

Some of the more recent work on internal friction phenomena due 
to gases in solution has used the internal friction measurements as a 
quantitative indication of the amount of carbon or nitrogen in solution 
in alpha-iron. That the magnitude of the internal friction is strictly 
proportional to the concentration of either the carbon or the nitrogen 
in solid solution, for small solute concentrations, was predicted and 
verified by Disxkstra*’ for the case of alpha-iron. In a later paper 
DisKsTRA® used this relationship to follow the precipitation of either 
nitrogen or carbon from solid solution, and also to measure their 
solubility in alpha-iron as a function of temperature. It is pointed out 
that the internal friction at the maximum of the internal friction peak 
(Qmax) has been related to resistivity which has, in turn, been related 
to the solute element concentration. This has shown that Q,,,,~* is 
nearly equal numerically to the weight per cent concentration of either 
carbon or nitrogen in solid solution. The present writer feels that it is 
not necessary to review DisksTRa’s work which is very interesting, 
but slightly beyond the scope of this review. As well as the facts above, 
DisksTRa stated also that hydrogen had no effect on the internal friction 
of alpha-iron in the temperature region between — 40° and 100°C, 

Despite the fact that Fast®* had reported that the nitrogen peak in 
iron becomes broader and appears at a higher temperature (nearer the 
carbon peak) due to the presence of manganese in iron, JOSEFSSON and 
Kv a’ have applied the internal friction peak effect to the study of 
ageing and related phenomena in commercial mild steel. This work has 
confirmed the fact that the presence of impurities, such as manganese 
(and perhaps phosphorus), does change the position of the nitrogen peak 
and broadens the internal friction curve. Also it was confirmed that the 
carbon peak was not changed in position, although it may have been 
broadened by the addition of manganese to the iron. Ageing curves 
(as a function of internal friction) have been given for four steels which 
were each treated in several ways. The removal of nitrogen from solution 
by means of an aluminium addition, and the tendency for nitrogen to 
remain in supersaturated solid solution, even at slow cooling rates, in 
certain steels was also shown. 

Finally, the most recent work which has been published is a brief 
note by MARINGER and MUEHLENKAMP,* who have found an internal 
friction peak at 1300°F, and 1 c.p.s., in molybdenum, caused by the 
stress induced diffusion of nitrogen atoms in solid solution. 

Other effects, similar to those which have just been described, have 
been reported in the literature. KOsTER® studied an unexpected 
decrease in the modulus of elasticity and increase in damping capacity 
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with the introduction into several metals of solute atoms of small 
solubility. It was found that the anomalous behaviour was due to small 
amounts of oxygen held in the copper and silver being studied. KOsTER 
decided that, from knowledge of the nature of damping, the effect can 
be ascribed to stress-induced diffusion of the solute atoms. Later, 
ZenER'™ was able to show that the presence of residual elastic strain 
energy is associated with a reduction of the elastic moduli, in agreement 
with K6sTeErR’s observations. It 
is proposed that, as a solute 
atom passes over the potential 
energy barrier between two 
stable interstitial positions, the 
lattice acquires additional strain 
energy which is associated with 
a lowering of the elastic moduli. 
Two brief summaries of papers 
by Marx! and KorsLer™ add 
to the information available 
on the effect of hydrogen in 
copper. In each case the hydro- 
gen has a marked effect on the 
elastic properties of the metal. 
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jo Some studies have been carried 
A out on the effect of gases on the 
Fig. 4. Photoelectric activity of zirconium photoelectric work-function of 
(RENTSCHLER and HENRY"). some metals. When such metals 

as thorium, uranium, calcium, 

caesium, and barium react with oxygen, the photoelectric threshold 
is shifted toward a longer wavelength. RENTSCHLER and Henry’ 
first studied the metals just mentioned and later studied zirconium, 
titanium, hafnium, and thorium. In the latter paper it was pointed 
out that when oxygen, for instance, is present as an oxide at the 
grain boundary of a metal, a sputtered film from this metal forms a 
surface from which the true photoelectric threshold characteristic of 
the pure metal may be measured. It is also to be expected that, if the 
gas is in solution in the metal, surfaces sputtered from different batches 
of the metal will yield different values of the photoelectric sensitivity, 
due to variations in the gas content of the different batches. RENT- 
SCHLER and Henry predicted that this would be more a difference in 
relative sensitivity to different wavelengths than in a given threshold 
wavelength. It was, in fact, found that, unlike the other metals men- 
tioned above, different photo-cells of zirconium and titanium responded 
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differently to various wavelengths even though the photoelectric 
threshold was the same. 

RENTSCHLER and Henry’ were able to show the effects of various 
treatments with gas on the photoelectric response of zirconium. When 
a small amount of oxygen reacted with the active metal surface the 
threshold wavelength increased. Fig. 4 shows the photoelectric activity 
of zirconium in different conditions. Curve 1 is the response of pure 
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Fig. 5. Photoelectric activity of titanium (RENTSCHLER and HENRyY™). 


zirconium metal. Curve 2 is the response from a surface sputtered from 
zirconium previously heated to 400°C in hydrogen. Curves 3 and 4 
show the response from surfaces sputtered from zirconium heated in 
nitrogen and oxygen respectively. It was found that the response from 
a surface sputtered from an oxygen-bearing wire does not depend on the 
time of sputtering. Thus the lower work function is independent of the 
film thickness provided the sputtering is done from metal with the gas 
uniformly dissolved throughout the metal. Also the threshold wave- 
length was found to be independent of the oxygen content (provided 
there was some in solution), but the shape of the response curve is 
dependent on the oxygen content. These facts were interpreted to 
mean that the film consists partly of pure meta! with no gas, and partly 
of metal containing gas in solution, the former having a low threshold 
and the latter a higher threshold wavelength. 


«J 





PROGRESS IN METAL PHYSICS 


Fig. 5 shows the same type of thing for titanium where the effect of 
the dissolved gases is even more marked than for zirconium. Curve 1 
represents the photoelectric response for pure titanium; Curve 2 is 
the response for a solution of nitrogen in titanium; and Curve 3 the 
response for oxygen-bearing titanium. 

In an interesting, but only qualitative experiment, RENTSCHLER and 
Henry found that titanium with oxygen in solution is not as volatile 
as the pure metal. This effect has also been found with zirconium. 

It was found that thorium behaves like zirconium and titanium but 
only in so far as the gas increases the threshold wavelength. Both 
oxygen- and nitrogen-treated surfaces have a response similar to that of 
Curve 2, Fig. 4, i.e. a response lower, at low wavelength, than that of the 
pure metal. Since the thorium behaved like zirconium and titanium 
where the gas has reacted only with the surface of the metal, it was 
assumed that the gas does not dissolve as readily throughout the 
thorium. No explanation’ was given of the reason for this effect. 

Recently, ANDERSON™ has studied the work functions and ageing 
characteristics of copper. The work functions of fourteen surfaces 
were measured by means of contact differences of potential with respect 
to barium surfaces of a known work function. The surfaces were pre- 
pared from spectroscopically pure copper which had been subjected to 
40 vacuum fusions and fractional distillations. The result of some 200 
measurements showed that the first films to be deposited were not 
reproducible, and drifted rapidly with time to a lower work function. 
Later deposits were more reproducible, and the rate and magnitude of 
drift decreased. For the specimens 9-14 there was good reproducibility 
and no drift. At no time was there any drift to a higher work function. 
The variability is due to dissolved gases which are retained by the 
copper despite vacuum fusions, and are reabsorbed into the condensed 
film. Since the gas, presumed to be oxygen, lowers the work function, it 
is concluded that the gas is in solution and not merely adsorbed on the 
copper. This conclusion is reached since oxygen adsorbed on a clean 
tungsten surface raises the work function, and copper would be expected 
to behave similarly if it had an adsorbed gas film. However, it must be 


emphasized that there was no proof of the presence of oxygen in the 


copper. Fast®* agrees that the adsorption of oxygen causes an increase 
in the work function. 

A number of papers on the influence of hydrogen on photoelectric 
sensitivity have been listed by Smrru.* Except for those reported in 
this paper there appears to be no recent work on this subject. 


Lattice Expansion 
Recent work on the lattice expansion of metals due to the presence of 
dissolved gas in the lattice appears to be almost non-existent. A 
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relatively recent paper, which has some bearing on the subject, is by 
Hume-Roruery and ANDREws.™™ In studying the lattice spacing and 
thermal expansion of copper, they accidentally oxidized some copper 
at 771°C. This metal was cooled to room temperature, reheated to 
870°C for one hour and again cooled to room temperature. X-ray 
analysis of this showed oxide lines, but the copper lattice spacing was 
the same as that of pure copper. At higher temperatures the lattice 
spacing of the oxidized metal was only slightly higher than that of the 
pure metal, the difference at 770°C being 0-0004 A. No attempt was 
made to examine equilibrium conditions. 

A large part of the available information on metals which dissolve only 
small amounts of gas is that for electrodeposited chromium. HvtrTric 
and BropKorB! report that it has a body-centred cubic lattice which 
has a larger parameter than that of chromium which has been melted, 
and this was taken to indicate the entry of hydrogen into solid solution. 
Woop" later found, along with a marked line broadening, a very slight 
displacement of the lines of a diffraction pattern. He concluded that 
a small part of the hydrogen in chromium is in solid solution, but that 
the major portion is adsorbed on the walls of ‘crystallites’ which cause 
the diffuseness of the diffraction lines. HumE-RoTHERY and WYLLIE” 
confirmed the formation of a body-centred cubic structure, and recog- 
nized the possibility of “‘abnormal lattice spacings due to co-deposited 
hydrogen” but did not study the effect in detail. 

Recent work has been done on electrolytic chromium by PoTTeErR and 
LvKENS,”° who measured the effect of hydrogen on the lattice parameter. 
As in other cases, the electrodeposited metal gave a normal body-centred 
cubic diffraction pattern with the lattice parameter somewhat enlarged 
compared to gas-free metal, and with a broadening of the lines. This 
was taken to indicate an enlargement of the lattice with variations in 
the lattice parameter of 0-3 to 0-4 per cent, causing the line broadening. 
On degassing at 500°C, the parameter decreased somewhat and the lines 
were less diffuse. The sample degassed at 1000°C showed a normal lattice 
with sharp lines of almost the correct spacing. Similar effects had been 
observed with manganese™ on degassing at 300°C and 500°C, 
respectively. 

In discussing this paper, SNAVELY briefly described a theory which 
he had developed from experimental evidence reported in another 
recent publication.“* It was suggested that unstable chromium 
hydrides were deposited, with the hydrogen atom in interstitial positions 
forming either a wurtzite- or fluorite-type structure with the chromium, 
depending on the hydrogen content. Since these hydrides are unstable 
at ordinary temperatures they dissociate rapidly after deposition to 
form atomic hydrogen and body-centred cubic chromium. A large 
volume change (15 per cent) accompanying this reaction is believed to 
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be related to the hardness, brittleness, and cracking characteristics of 
this type of electrodeposit. ‘Microfissures’’ are formed during the 
volume change and the hydrogen, which has a very low solubility in the 
chromium lattice, is “occluded” in the fissures. These fissures are 
apparently the cause of the line broadening in the diffraction patterns. 
Porrer and Lukens had suggested the possibility of a supersaturated 
interstitial solution of hydrogen in chromium. However, SNAVELY 
pointed out that the body-centred cubic chromium lattice has interstices 
which are about 1-58 A in least diameter, while the hydrogen atom is 
less than | A in diameter. Syave.ty™®. ™? considered that “an interstitial 
atom must be sufficiently large to contact all surrounding atoms if a 
stable or even metastable interstitial solid solution’’ is to form, and, 
therefore, the possibility of chromium absorbing hydrogen is small. 

Smrru® reports work by two investigators who found lattice disten- 
sions in iron and nickel which had been electrodeposited from solutions 
containing arsenic. SMITH claims that the observed increase in para- 
meter, accompanied also by line broadening, is due to the arsenic. 

It must be appreciated that the gas content of an electrodeposited 
metal is not an equilibrium gas content, and may cause strains not 
present in an annealed metal. It seems that no lattice parameter 
increase has been found in those metals which dissolve only small 
amounts of hydrogen when in equilibrium with the gas phase. This is 
really to be expected if one considers the size of the hydrogen atom com- 
pared to the relatively large interstitial space available for the small 
number of hydrogen atoms absorbed. However, Smrrx* uses this 
absence of true lattice expansion as an argument in favour of his “‘rift’’ 
hypothesis. 

Potrer and Hvuser™ have studied the effect of hydrogen on the 
lattice parameter of electrolytic manganese. It was found that the 
manganese lattice is distended, the average lattice parameter enlarge- 
ment being about 0-0003 per cent per c.c. of gas per 100 g of metal. A 
widening of the lines in the x-ray diffraction patterns of gas-bearing 
specimens indicated a deviation from the average parameter caused by 
non-uniform distortion of the lattice. This non-uniform lattice was 
attributed to non-uniform gas distribution. The effect of various 
degassing treatments on the metal indicated that the parameters were 
not affected unless the metal is heated under a reduced pressure at over 
300°C. At 500°C and over, the lattice returned to its normal size and 
the formerly broadened diffraction lines became sharp. It is pointed out 
that although the hydrogen in solution is only 0-02 per cent by weight, 
under normal conditions it would occupy a volume twenty times that of 
the metal containing it. The authors state further: “It is not unreason- 
able to expect that this highly compressed gas would have some effect 
on the structure of the metal containing it.” The concept of this 
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dissolved gas being merely “highly compressed gas’’ is hardly in keeping 
with other information. The non-uniformity in the lattice parameters 
mentioned above is not proportional to the amount of gas in the metal. 
The deviation from the mean lattice parameter (line broadening) is 
zero at zero gas content, high at 95 c.c. per 100 g of metal, and decreases 
with increasing gas content up to the observed gas content of 615 c.c. 
per 100 g of metal. It is shown that this indicates that the deviation 
might again be zero at a value of about 695 .c.c. hydrogen per 100 g 
manganese, when the metal unit cell would contain exactly 2 gas atoms. 
Since there was some diffuseness in the lines at 350 c.c. of hydrogen 
per 100 g of manganese it was concluded that the average distribution of 
1 atom of gas per unit cell was not uniform and that the cells could 
take two atoms each. 

In another paper™* it is inferred on the basis of related phenomena 
described in the literature, that, owing to the absorption of hydrogen 
by nickel, systematic lattice defects arise in the electrolytically deposited 
metal. These defects amount locally to about a 13 per cent change in 
the lattice constant. These local lattice distortions cause the fine- 
grained structure and hardness of the deposit. In another experimental 
study™ it is shown that hydrogen participates directly in the formation 
of the structure of the electrodeposit of nickel. One other abstract 
refers to a paper” which describes an experimental spectrographic 
study of the state of hydrogen in a metal, the action of x-rays and the 
structure of the lines of the K,,_,. doublet. It is claimed that hydrogen 
in the form of protons is a factor in determining the structure of the 
doublet and also the properties of the material. 

Considering those metals which dissolve larger amounts of hydrogen 
SmiTH* in various parts of his monograph discusses the effect of 
hydrogen on the metal lattice. Most of the experimental work has been 
done on palladium, but information is given for other metals which 
absorb hydrogen readily. Representative of the studies on palladium 
is the work of Owen and Jongs™*. ™? which will be summarized here. 
By making measurements of the palladium lattice parameter under 
isothermal conditions and at varying pressures, the hysteresis effect 
when the pressure of the gas is taken through a cycle was determined. 
Fig. 6 shows the results which were obtained at 100°C. Those parts of 
the isothermals in which only a single component existed could be 
reproduced by simply altering the pressure. This reproducibility 
disappeared, however, when a second phase was present. 

It was found that the hydrogen solubility changed rapidly when the 
beta phase appeared or disappeared. The measurements made are 


interpreted as meaning that the system consists of two solid solutions 


at the temperatures considered, namely 100° and 120°C. The hydrogen- 
rich beta phase is closely associated with the combination Pd,H which 
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is capable of dissolving more hydrogen. The beta phase was always 
distorted no matter what precautions were taken to obtain it without 
distortion. It had been suggested earlier that this phase had an 
orthorhombic lattice, but this was not confirmed. It was found too that 
the distortion of the lattice remained after removal of the hydrogen. 
Owen and Jones state that, “The presence of the hydrogen, even 
though it enters the palladium lattice interstitially, causes a permanent 
displacement of the palladium atoms which if the concentration is high 
enough, cannot be removed without subjecting the material to a 








Fig. 6. Effect of hydrogen pressure on lattice parameter of palladium 
OWEN and Jongs'"* 


temperature well above that required to remove the hydrogen from the 
metal. This would suggest that at certain concentrations the gas may 
not be in its normal state when it leaves the metal.”’ 

Recently CLark™* has reported a new determination of the lattice 


constants of alpha (hexagonal close-packed) titanium. Comparing his 
results with earlier work, CLARK found agreement with Fast’s work but 
not with others. CLARK is able to show how the C, value is markedly 
increased by oxygen and nitrogen which form interstitial solid solutions, 
while the a, value remains essentially unchanged. Since the only points 


of disagreement with earlier work were in the C, values, CLARK concluded 
that the other materials tested must have been contaminated with 
oxygen or nitrogen. Fig. 7 shows CLARK’s results indicating the effect 
of the two gases on the lattice. 

Busk and BosaLex™® found a similar effect in the case of a 
magnesium + 10 per cent aluminium alloy charged with hydrogen in 
an alkaline solution. In this case both parameters had increased 
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significantly, and it was concluded that the hydrogen had dissolved 
interstitially. 

The most recent discussion on the lattice parameter has been pre- 
sented by Smrru.®* X-ray evidence of so-called ‘rifts,’ or widened 
lattice spacings in work-hardened metal, as compared with the relatively 
continuous and uniform lattice of fully annealed metal is discussed in 
connexion with hydrogen solubility. SmirxH claims that the charac- 
teristic line-broadening associated with the absorption of hydrogen is 
due to widened lattice spacings, caused initially by plastic deformation, 
but which may be extended by hydrogen absorption. The development, 
manifestations, and effects of the “‘rifts’’ are described in this paper.™ 
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Fig. 7. The effect of the addition of small amounts of oxygen and 


nitrogen on the lattice parameter of alpha titanium (CLARK"*). 


Studies of the relationships between lattice parameter, hardness, 
resistivity, and volume changes have been carried on for a number of 
years by a French group. PorTevixn, CHaupDRON and Moreau! 
showed that the electrodeposited metals are much harder, have lower 
conductivity and have broader x-ray diffraction lines than annealed 
metals. By using the cold bombardment method for degassing, it was 
shown that the removal of hydrogen from electrodeposited iron altered 
neither the hardness, the lattice parameter, nor the diffuseness of the 
lines. Annealing after cold degassing caused no appreciable removal of 
gas, but did cause a decrease in parameter, a sharpening of the x-ray 
lines, and the expected softening of the metal. A special type of cold- 
working by the hydrogen atoms was postulated, the metal remaining in 
a strained condition after removal of the gas. Cathodically charged 
Armco iron was found to have a decreased tensile elongation and 


increased resistivity. Similar changes were found in the properties of 


aluminium after the same treatment. 

In studying the palladium-hydrogen system, MicHe.'™ found that, 
on changing the composition from pure Pd to Pd,H by the addition of 
hydrogen, the Brinell hardness increased from 47 to 170, while the 
lattice parameter increased from 3-882 to 3-885 A, the hydrogen atoms 
occupying the body-centred positions in a face-centred cubic palladium 


13] 
















PROGRESS IN METAL PHYSICS 


lattice. The beta phase, PdH, was found to have a Brinell hardness of 
only 70. On heating PdH at 310°C, hydrogen was slowly evolved and the 
hardness rose to 170 Brinell after about 2 hours. After that time the 
alloy with a composition corresponding to Pd,H had a hardness which 
slowly fell to a value limited by the annealing temperature. When the 
PdH was degassed by cold bombardment, the lattice remained distorted 
and the hardness was unaffected. 

The results which had previously been obtained for iron and alumi- 
nium were reproduced in the case of palladium and tantalum by 
CHAUDRON, PorRTEVIN and Moreau.” The results obtained for palla- 
dium were as follows: 








Ratio of Lattice Brinell 
Resistivities) Parameter | Hardness 





f om position 
















Pure annealed Pd l 3-882 45 
Beta phase 1-69 4-020 77 
Beta phase degassed (ionic bombardment) 1-69 4-020 77 
Beta phase degassed and annealed at 600°C l 3-882 45 
Alpha phase 1-45 3-885 170 
Alpha phase degassed (ionic bombardment) 1-45 3-885 170 
Alpha phase degassed and annealed at 600°C l 3-882 45 














Micnet, BENaRD and CHavupRoNn'™ followed the changes of the 
lattice parameter and diffuseness of the x-ray diffraction lines as 
palladium was electrolytically charged with hydrogen. Since the beta 
phase appeared as soon as hydrogenation started, and from subsequent 
observations, it was concluded that a superficial layer of the beta phase 
is formed and the hydrogen diffuses from this phase into the metal to 
form the alpha solution. When the inner portion of the metal has 
reached a composition corresponding to Pd,H, then the beta phase 
starts to advance into the body of the metal by diffusion of an excess of 
hydrogen over that amount required for the alpha phase. On exposure 
to air, hydrogen was found to leave the beta phase, the lattice para- 
meter decreasing to a minimum value of 4-01 A (at room temperature). 

Finally Micue.t™ carried out a thermal investigation on the hydrogen- 
palladium system by dilatometric and differential analysis. The 
results obtained were interpreted as being in agreement with the 
results of the previous x-ray analysis. The diagrams in Fig. 8, taken 
from MICcHEL’s paper, show the dilatometric curves for both the alpha 
and beta phases. On heating in nitrogen, the alpha phase starts to 
contract at about 200°C, and the beta phase starts at about 100°C. 
In each case the contraction is complete at 300°C. This is shown in 
parts a and 6 of Fig. 8. The differential thermal analysis of the beta 
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phase, heated in nitrogen, is shown in part d. Starting at room tempera- 
ture there is an absorption of heat due to the removal of hydrogen, 
and this is followed by a marked exothermic effect due to the contrac- 
tion of the lattice at about 300°C. The same experiment repeated on a 
specimen which has been degassed by ionic bombardment will yield 
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Fig. 8. Thermal and dilatometric relationships in the palladium-hydrogen 
system (MICHEL!*), 


the curve shown in part e. The exothermic effect persists, but no heat 
is required to remove the hydrogen. 

PoRTEVIN™ briefly expressed the same ideas that the distortion of 
the crystal lattice persists after degassing, because of a ‘“‘cold-working’”’ 
effect by the gas atoms. The temporary introduction of gas atoms may 
be used to modify the resistivity of the metal. 


Magnetic Properties 

Although the present author has found no report of any effect of 
oxygen or nitrogen on the magnetic properties of metals, the effect of 
hydrogen is well established for palladium and iron. It has been found 
that the strong paramagnetism of palladium decreases, on absorbing 
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hydrogen, from the value of the susceptibility for pure palladium to a 
final value which is reached at a concentration of H/Pd = 0-55 to 0-66. 
The susceptibility remains approximately the same at concentrations 
higher than the rather uncertain limiting composition. It should be 
noted that the decrease in susceptibility is not due to the formation of 
the beta phase, since the addition of a small amount of hydrogen will 
lower the susceptibility prior to any possible beta formation. SmirH? 
has reported several explanations of this effect, the simplest probably 
being that of Morrand Jongs’** who concluded that the hydrogen 
electron goes into the d shell, thereby decreasing the number of vacancies 
in the palladium d shell and hence the paramagnetism. Since the palla- 
dium atom has 0-55 holes per atom in the d shell, and since it had been 
shown that the magnetic susceptibility of the palladium was reduced to 
zero by the addition of about 0-55 hydrogen atoms to each palladium 
atom, Motr and Jones also concluded that the dissolved hydrogen 
atom must be completely ionized. Since some investigators report the 
H/Pd ratio to be as high as 0-66 when the loss of paramagnetism is com- 
plete, SmrrH? concludes that one-sixth of the hydrogen atoms remain 
uncom bined and are contained “either in the interstices of the distended 
lattice of the beta phase or outside the lattice proper in the rifts. . . .”’ 

Sreverts and Danz!’ have shown that the effect of deuterium on the 
magnetic susceptibility of palladium is exactly the same as that of 
hydrogen. Thus it may be assumed that the deuterium atom also yields 
its electron to the d shell of palladium thereby rendering it diamagnetic. 

Voet"** did experiments which indicate at least roughly that the 
addition of gold to a palladium-hydrogen system causes a similar 
decrease in the magnetic susceptibility as that brought about by the 
hydrogen. This lowering of the susceptibility is complete at 50 to 60 
atomic per cent gold, and at this composition the hydrogen was found 
to be insoluble. Vooet concluded that gold and hydrogen have equal 
effects on the paramagnetism of palladium. 

The most recent information which the writer has been able to find 
in the literature is a paper by FrrzwiLi1aM, KavurMANN and Squrre!”® 
who studied the effect of hydrogen on the magnetic properties of 
titanium and zirconium. An interesting point is that, on absorbing 
hydrogen, the paramagnetic susceptibility of titanium increased. The 
reason for this was not explained. Only one composition (1-2 atoms of 
hydrogen per atom of titanium) was apparently considered. In the 
original zirconium specimen, an iron content of 0-11 per cent played 
no part in the magnetic susceptibility except at liquid hydrogen 
temperatures. However, on addition of hydrogen and with suitable heat 
treatment, “the iron appeared to precipitate out of solution with the 
zirconium in that it gave a ferromagnetic component to the magnetiza- 
tion.”’ It is believed that the iron must have actually formed clusters in 
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order to cause the observed effects. However, the authors were able to 
separate the effect due to hydrogen alone, and that due to iron, on the 
susceptibility and they have presented a curve showing the paramag- 
netic susceptibility of zirconium as a function of hydrogen concentra- 
tion. The curve is similar to that for the palladium-hydrogen system but 
differs in that, instead of being a linear decrease, the curve is concave 
toward the axes, and a constant value of susceptibility is reached at a 
positive, rather than a diamagnetic, value as shown in Fig. 9. It is 











Ae 


Atoms of H per Atom Zr 


Fig. 9. Magnetic susceptibility of the system Zr-H as a 
function of hydrogen concentration at 295°K (FITZWILLiaM, 
KAUFMANN and SQuIRE”™®). 


interesting to note that the Curie point of the precipitated iron was 
found at 585°K, instead of near 1000°K, as in the lattice of pure iron. 
Other experiments relating structure, temperature, hydrogen content, 
and susceptibility were also described. 

The effect of interatomic distances on susceptibility has been applied 
by MicHEL in an attempt to explain the magnetic behaviour of the 
palladium-hydrogen system. In 1939 MicnEeL and GaLLissoT, using 


TABLE 3 





Brinell 


™ 2 
Composition ; R/Ro Hardness 


Pure Pd 

Pd,H (alpha Pd 

Pd,H degassed by bombardment 
PdH (beta Pd 

PdH degassed 


Pd,H or PdH degassed and then annealed 





R electrical resistance 


R, = resistance of pure Pd 
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the cold-bombardment method of degassing developed by CHAUDRON 
and Moreau, determined the respective effects of hydrogen and inter- 
atomic distance in the variation of the susceptibility of palladium. 
The experimental results are given in Table 3 on page 135. These 
results seem to show that the magnetic susceptibility is unaffected by 
degassing. The authors attribute the large variations in susceptibility 
to the modification of the interatomic distances by the dissolved 
hydrogen. 

The effect of hydrogen on the magnetic properties of those metals 
which absorb only small amounts of the gas is not so well explained as 
in the systems just discussed. Smiru*? has reviewed the effects of 
hydrogen on the magnetic properties of iron and manganese, and 
concludes that the influence of hydrogen is probably due to its effect in 
cold-working the metal and also its purifying effect. It is indicated* that 
the susceptibility (and permeability) of iron at low field strength (about 
10 oersteds) increases with decreasing hydrogen content, in much the 
same way as in the palladium-hydrogen system. 

Extremely high permeabilities have been reported for iron annealed 
near 1500°C in wet hydrogen for 30 minutes, and then cooled rapidly to 
a temperature just below the A,-point of pure iron, and held there for 
some time. The author attributed the high permeability to the high 
purity of the metal, due to the hydrogen treatment, and not to any 
residual hydrogen content. Smrru* claims that the effect is due to the 
annealing only, the atoms (of metal) being “no longer in the altered 
states in which they presumably exist in the vicinity of a rift."" Watt™ 
confirmed the improvement in the magnetic properties of iron by the 
hydrogen treatment, and found similar improvements in the case of 
cobalt and nickel. 

WHEELER™ found that the absorption of hydrogen by manganese 
lowered the susceptibility of alpha-manganese at 450°C, and beta- 
manganese at 850°C, but above 1200°C raised that of face-centred 
tetragonal or molten Mn. At no time was the metal reported to 
become ferromagnetic. 

SmitTH* concludes that the modification of the magnetic properties of 
those metals which do not dissolve large amounts of hydrogen must be 
due to some cold-working of the metal. At the same time it is recog- 
nized that a similar cold-working should take place in Pd, and this 
would be expected to alter the linear dependence so well explained by 
the electron theory of metals. This anomaly does not seem to have 
been explained. 

Finally, Smrrn* has attempted to show a reciprocal relationship 


between paramagnetic properties and capacity to absorb hydrogen. It 
is stated that all highly magnetic metals absorb measurable amounts 
of hydrogen, while elements with zy less than 0-8 x 10-* do not, with 
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five exceptions, absorb hydrogen. SmirH has proposed that the relation 
between paramagnetism and hydrogen content may be due to some 
process which precedes a final arrangement. This process is postulated 
to be the “penetration of the metal’’ which factor is determined by “‘the 
rift-state.”’ It is suggested that the “rifts” cause “changed electronic 
configurations’ which in turn alter the susceptibility. Earlier, in 
explaining the sudden decrease in the magnetic permeability of iron 
after being cathodically charged with hydrogen for some time, 
SMITH said : 

“Since saturation is that concentration of a phase which is in equili- 
brium with a certain pressure, we shall be making little more than a 
change of terminology if we suppose the critical stage to be that at 
which the pressure of the rift hydrogen, steadily building up toward 
the high effective pressure of electrolysis at high current density, 
reaches a value at which the lattice lets go and sudden enlargement of 
the whole rift system takes place, the resulting increase in inter- 
atomic spacings causing alterations of electron configurations in the 
atoms, which are, in turn, perhaps responsible for the changes in 
magnetic properties.” 

The present writer finds it very difficult to visualize a metallic 
lattice, in the so-called elastic range, “‘letting go’’. 

SEEMAN™ repeated some experiments done earlier by SHIMIZU on 
the effect of gas on the susceptibility and electrical conductivity of 
gold-silver alloys before and after heating in vacuo, heating in nitrogen, 
and remelting in vacuo. The above treatments had no effect on the 
magnetic properties of either the alloys or the pure metals. SEEMAN 
stated that the effect of oxygen on the susceptibility of diamagnetic 
and paramagnetic alloys is still not accurately known, but for precise 
determinations complete degassing of the specimens was recommended. 

Maun™ has stated that dissolved gas has a marked influence on the 
value of the magnetism and, therefore, the alloys (the thallium-lead 
system was used as an illustration) must be rigorously annealed in 
vacuo after each measurement. 


Electrical Resistivity 

The present writer has been able to find very little information des- 
cribing the effect of dissolved oxygen or nitrogen on the electrical 
resistivity of a pure metal. For those metals which dissolve only small 
amounts of hydrogen there appears to be no recent experimental 
evidence to show a certain effect. Smrru* has briefly reviewed the 
literature available on this subject up to 1946 and the reader is referred 
to SMITH’s supplementary notes (pp. 258, 262, 265, 267, 268, 275) for 
other references. Although the evidence seems to be rather uncertain, 


and the experimental work was done many years ago, it seems that the 
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resistivity of iron will increase a small amount on absorbing hydrogen 
from the gas phase. If the iron is charged with hydrogen cathodically 
then the resistance will increase to a limiting value of hydrogen content 
and thereafter starts to decrease. This effect is similar to that which 
occurs when palladium absorbs hydrogen, and will be described later. 
When nickel absorbs hydrogen from the gas phase, experimental results 
indicate any one of the following effects: (a) an increase, (5) a decrease, 
and (c) no effect on the resistivity. When the nickel was cathodically 
charged with hydrogen the effect is also doubtful. One report has 
indicated a decrease in resistivity due to “protonic conduction.’’ The 
same remarks as have been made for nickel seem to apply also to the 
case of platinum. No effect of any kind on the resistivity of copper and 
silver has been found due to hydrogen. 

As in the case of all other physical properties of the palladium- 
hydrogen system, the electrical resistivity seems to have been studied 
quite thoroughly. As in the case of other properties, Smrru*: has 
used the experimental evidence in the case of palladium-hydrogen to 
support his view that a large proportion of the hydrogen present in 
palladium is concentrated in “rifts.” His arguments will not be 
presented here, but in preference to them the writer will review some 
recent work by Wricut™* which is in agreement with the earlier x-ray 
work discussed by Owen.’ It is pointed out that the exceptional 
properties of the transition metals are due to the partly filled d shell. 
It was stated in the discussion of magnetic susceptibility that palladium 
has approximately 0-6 unoccupied states per atom in the d band. 
Generally it is accepted that the hydrogen dissolved in palladium is 
almost completely ionized, and its electrons enter this partially filled 
d band. Thus the palladium d shell is filled when the H/Pd ratio is 0-6. 
This seems to be true for palladium in equilibrium with the gas phase, 
but Smrru? has shown several cases where the H/Pd ratio is higher than 
0-6 when the palladium is electrolytically charged with hydrogen. 
This has been explained by Smrru to be due to hydrogen being held in 
the widened lattice spacings of the metal lattice. WaGner,™ in calcu- 
lating the hydrogen solubility from thermodynamic principles, has 
ussumed that this excess hydrogen is located interstitially in solid 
solution, the electrons entering the s band of the palladium. Wricut 
has presented some theoretical ideas proposed by LACHER and WaGNER 
to account for the form of the concentration-pressure isotherms of 
hydrogen in palladium. It was concluded by WAGNER that the existence 
of a hysteresis loop is due to a failure to establish thermodynamic 
equilibrium. The relatively low temperatures of observation allowed 
only restricted exchange of palladium atoms between the two phases of 
the palladium-hydrogen system. Experimental studies of the hysteresis 
effect in the palladium-hydrogen system indicated that reversible 
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isotherms are obtainable only by using palladium in a finely divided state. 
Quite contrary to the definite opinions of Smirx*: *°, Wricut has 
interpreted the work of SreverRTs and his co-workers,™: *, 15%, 4° and 
of OweEN and his co-workers,™*: 15’, to indicate that “the mixed 
phase or (alpha + beta) regions within the hysteresis loop are charac- 
teristic of a system in which the thermodynamic equilibrium is not 
completely established. The x-ray studies also indicate that the new 
phase, alpha or beta, exists finely dispersed throughout the old phase, 
the establishment of a new 
phase occurring, presumably, 
by the process of nucleation 
and growth.” LAacHER’s 
formula, which explains the 
hysteresis effect, is based on 
assumptions which are sup- 
ported by the above conclu- 
sions. SMITH has claimed these 
assumptions to be invalid in the 
light of experimental evidence 
which appears to indicate that 
the two phases of the palladium- 
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does exist over a small range . 
: Fig. 10. Resistance-pressure isotherms 
of pressure. for the palladium-hydrogen system 

Wricut determined resis- (WRIGHT"™*). 
tance isotherms at several 
temperatures, allowing sufficient time after each pressure change for 
the resistance to become almost constant. The results of his measure- 
ments are shown in Fig. 10. It was shown that the limits of the 
hysteresis loops were quite close to the corresponding limits for the 
occlusion isotherms. The transitions from one phase to another have 
been described on the basis of x-ray, solubility, and electrical resis- 
tance measurements. If one accepts that the nuclei of a new phase 
are dispersed throughout the old phase, then the hysteresis effect may 
be explained by the contribution of the surface between the two 
phases to the free energy of the system. Wricut has described this 
mechanism : 

“The vapour pressure of the dispersed beta-phase particles would be 
greater than the pressure they would exert in bulk, hence they could 
not form without some degree of supersaturation in the alpha phase. 
Similarly during evolution of hydrogen, an undersaturated beta phase 
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would be necessary for the formation of alpha-phase nuclei. Over a 
certain range of pressure up to the point where the isotherms rise (or 
fall) sharply, it must be assumed that the two phases co-exist in a 
state of metastable equilibrium. Beyond this stage, the degree of 
supersaturation in the alpha phase (or undersaturation in the beta 
phase) is such that the system becomes unstable and the new phase is 
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Fig. 11. Concentration-pressure isotherms at 100°C. 
Experimental: SIEVERTS and Danz, — 
Theoretical : Il. LACHER , equilibrium ———— 
Il. WAGNER ——-—, path followed between upper 
and lower isotherms - — - — - — (WricuT**). 


rapidly formed in fairly large blocks out of the finely dispersed 
particles.” 

The role of the hydrogen in each of the stages of the 100°C hysteresis 
loop shown in Fig. 10 has been clearly outlined by Wricut. The 
hydrogen in solid solution in the lattice will ionize, and the proton or 
hydrogen ion will increase the resistance by disturbing the periodicity 
of the lattice, while the electron will enter the unfilled d band of the 
palladium. These are opposing effects, the former tending to increase 
the resistivity, and the latter to decrease the resistivity. From the 
close correspondence between the concentration isotherm (Fig. 11) and 
the resistance isotherm, Wricut has deduced that the scattering 
centres provided by the hydrogen ions have a greater effect and, there- 
fore, increase the resistance of the palladium-hydrogen system. It was 
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also indicated that the increase in resistance due to cold working is 
small compared with this alloying effect, so that lattice distortion at the 
phase boundaries will be relatively unimportant in altering the resis- 
tance. Finally, the effect on the resistivity of a new phase in the system 
will be greatest when the phase particles are of comparable dimensions 
to the wavelength of the conduction electrons. 

In the first stage of the resistance-pressure isotherm at 100°C, the 
relatively greater effect due to decreased periodicity of the lattice 
caused by the hydrogen ions entering the metal, over the effect of the 
electrons entering the d shell, will result in an increase in the resistivity. 
When the beta phase starts to form at about 27 cm Hg, there is a slight 
increase of resistance due to strains at the phase boundaries. At about 
31 cm Hg, the system transforms completely to the beta phase. If this 
transformation were brought about by very small increases in hydrogen 
pressure with alternate long time periods allowed for equilibrium condi- 
tions to be reached, the vertical portion of the curve would become 
almost parallel to the resistance axis. In the beta phase, the d shell is 
filled and an increase in pressure appears to cause no further marked 
change in the resistivity up to normal pressure. By comparing Fig. 10 
with Fig. 11, one sees that there is an increase in resistance as the 
hydrogen content is decreased, i.e. as the pressure over the beta phase 
is lowered from normal pressure to 17 cm Hg. Since the alpha phase 
does not form until the pressure is decreased to about 19 cm Hg, there 
can be no increase in resistance due to strains, but WRIGHT has attri- 
buted it to an increase in the number of positive holes in the d band of 
palladium. As the pressure is decreased to 17 cm Hg, the system changes 
to the alpha phase. With the removal of hydrogen the resistance drops, 
since the fall due to reduced hydrogen scattering is a more important 
factor than the removal of electrons from the d shell which tends to 
increase resistance. Under ideal conditions this decrease in resistance 
would be shown as a line parallel to the resistance axis. This seems to 
the present writer to be a reasonable explanation of the phenomena 
which have been observed. 

Some time ago ANDREws! used electrical resistance measurements 
in conjunction with pressure-volume relationships to study the reaction 
of nitrogen, oxygen and hydrogen with tantalum. Fig. 12 shows the 
variation of resistance at room temperature with gas content of the 
metal. Unfortunately, no theory was suggested to explain these curves. 
It was found that in nitrogen, absorption increased with increased gas 
pressure until about 100 relative volumes were absorbed. At this point 
the equilibrium pressure dropped as absorption increased, although this 
had no effect on the resistivity. It was noted that the resistance at 
room temperature rises less rapidly with hydrogen concentration than 
for either oxygen or nitrogen. The reason given was: “It can be 
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assumed that this is due to the small atomic volume of the hydrogen, 
so that a given number of molecules dissolved in the metal would strain 
the lattice less than larger atoms.’’ Probably ANDREWS meant that a 
given number of atoms of hydrogen would strain the lattice less than 
oxygen or nitrogen. It is now generally accepted that the dissolved 


gas is not present as molecules. The sudden change in slope of the 
oxygen curve was attributed 
to the formation of an oxide 
compound, but no further 
| explanation was given. 
Cold resistance ~ Later, WesBER,’ basing 
4 mi . - his experimental treatment 
on the results of ANDREWS 
work, showed that gas content 
(particularly oxygen and 
nitrogen since the specimens 
were annealed in air) is not 
a major factor affecting the 
superconductivity of tanta- 
lum. In this case the gases 
were absorbed from the gas 
phase. 

In more recent work GoLik, 
LazAREV and K#HoTKEvicH™ 
150 have found that the temper- 

ature range of supraconduc- 

Fig. 12. Resistance at room temperature tive transition is broadened 

rie eaantResitance given the ag a result of electrolytically 

Siemens (ANnamrwet. charging tantalum with 
hydrogen. The broadening 

increases with the amount of hydrogen in the tantalum. At any 
given temperature between 1-85° and 4-2°K, the electrical resistance 
ratio R,/R,, was found to increase with the amount of occluded 
hydrogen. At temperatures down to 1-86°K, it was found that 
tantalum specimens could be made to lose supraconductivity altogether 
by saturation with hydrogen. The upper limit of the transition 
range remained unchanged, all samples showing a marked decrease 
in resistivity at that point. The condition of the specimens appeared 
to control the amount of hydrogen which could be occluded. For 
example, 0-340 mg of hydrogen was required to suppress supraconduc- 
tivity in a new tantalum sample. This wire was heated 10 hours at 
1700°C in vacuo, and its supraconductivity was completely restored. 
After this anneal, only 0-005 mg of hydrogen was required to suppress 
supraconductivity. In contrast to tantalum, it was observed that 
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the supraconductivity of niobium was held at 4-2°K after charging 
with hydrogen. It was concluded that this effect of hydrogen on 
tantalum could not be due to simple expansion of the lattice, but must 
be due to the formation of solid-solution alloys. 

DRAVNIEKS* used the change in electrical resistance as a means of 
measuring the rates of reaction of zirconium and copper with both 
ionized and non-ionized gases. The study was mainly one of surface 
reactions such as oxidation. The resistance changes noted in the 
copper specimens were taken to be caused 
by the formation of a cuprous oxide film 
which is practically non-conducting com- 
pared with copper. The conductivity of 
the zirconium changed as the reaction 
with gas proceeded due to: (a) the forma- 
tion of a zirconium-gas solid solution of 
lower conductivity than the pure zirconium, 
and (b) the formation of an oxide scale of Nitregen x7 
zirconium. The change of resistivity was % “4 Oxygen 
used merely as a measuring device, and 
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again no theory was proposed to account 
for the effect. 





s 
7: 
Recently JAFFEE and CAMPBELL have “i” | Mydrogen | 





reported that both oxygen and nitrogen | | 
raise the resistivity of titanium to the 05 7 
same extent per atomic per cent. Hydrogen Monie X 

up to one atomic per cent does not seem to Fig. 13. Effect of oxygen, 


change the resistivity. The results obtained nitrogen and hydrogen on the 
are shown in Fig 13 electrical resistivity of iodide 
€ . . vs 


TI te hewn 2 Ratieste Gum titanium (JAFFEE and 
1ere seem to have been reiatively lew CAMPBELL"). 


studies reported in the literature on the 

effect of gases on the electrical resistivity of pure metals other than 
palladium. Such work might prove to be very significant in showing 
the nature of the interaction of gases with metals. 

One might expect dissolved oxygen to lower the conductivity of pure 
copper. It will be shown later in this section that oxygen will increase 
the conductivity of slightly impure copper by removing the impurities 
from solid solution. SMART, SMITH and PxHI.urps’ have shown that 
there is a very small decrease in the conductivity of pure copper due to 
dissolved oxygen. Their results are shown in Table 4 on page 144. 

It is seen that there does seem to be a loss in conductivity due to the 
dissolved oxygen, but the relationship between the data obtained at the 





two annealing temperatures is inconclusive. 
In the course of work on internal oxidation, MEIJERING and Drvy- 
VESTEYN" determined the effect on the electrical resistance of annealing 
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TABLE 4 
Conductivity of Samples Annealed for One Hour at Indicated 


Temperatures and Cooled by Rapid Quenching 





Conductivity per cent 


Sampte Vo Dem riplion 


innealed at 500°( innealed at 850°C 


Oxygen-free 102-32% 
Oxygen-bearing 102-23 102-10f 
saturated at 850°C 

Oxygen -bearing 102-21¢ 102-23 

(Check run 
D* Oxygen-free 102-39! 


E* Oxvgen-bearing 102-19 





* Measurements on these samples were made in another laboratory 
* Anneaied in nitrogen 
Annealed in hydrogen 


silver- and copper-base alloys in oxygen. Table 5 gives in semi-quanti- 
tative terms the results of this work. Factors such as oxidation tempera- 
ture, and time of annealing before or after oxidation affect the resistivity. 
These factors are not considered in the table so the results are semi- 
quantitative. 
TABLE 5 


Resistivity in 10-* Q em at 0°C of Annealed (pa) and Oxidized (po) Silver and 


Copper Alloys 





Lem ription 


Silver + 0-33 per cent magnesium per cent 


Silver + 0-15 ,, » deryilium 
Silver + 0-26 ,, » summium 
Silver + 0-60 ., » titanium 
Silver + D9 ww » Zine 


Pure silver* 


Copper + 0 75 per cent magnesium 
Copper + 0-10 ,, »  vderyilum 
Copper + 0-69 ,, . vervillum 
Copper + OS] .. . Summum 
Copper - Il - , tUtanium 


Pure copper 








The percentages are weight per cent. The silver-zinc alloy was annealed in nitrogen 


to cut down evaporation of zinc 


* Evidently not very pure 


In the case of the copper alloys, the resistance after oxidation of the 
Only the non-hardening zinc 


alloy approaches that of pure copper. 
l44 
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alloy in the silver-base group decreased to the resistivity of pure silver. 


The alloys with titanium and aluminium had a resistivity well above 
that of pure silver and those of magnesium and beryllium increased 
above the value for the annealed alloy. 

The two main factors which influence the resistance have been 
described by Meiyertnc and DRUYVESTEYN as: 

(1) Internal strains cause generally an increase of the resistance as 
the lattice is distorted irregularly by these strains. This effect causes a 
decrease of the resistance by annealing. ZTamMann and Dreyer"®* 
found that by annealing after 98 per cent rolling the resistance of pure 








Exposure 


Fig. 14. Effect of atmospheric exposure on electrical conductivity of a 
Mg-Al-Zn alloy. Such exposure results in the addition of hydrogen to the 
magnesium lattice (BUsK and BoBaLeKk”"). 


silver decreases by 5 per cent, the decrease of pure rolled copper after 
annealing being only 1-6 per cent. 

“(2) A small number of foreign ions, substituted on the normal 
lattice places, increases the resistance, this effect being for copper and 
silver alloys the larger the higher the valency of the alloying element,’°* 
as ions with a high electric charge will scatter the electron waves con- 
siderably. When precipitation occurs and larger particles are formed the 
resistance will decrease. It is uncertain whether the scattering of 
electron waves in a silver lattice is highest on magnesium ions or on 
magnesium oxide molecules.’ 

Thus oxidation of the alloys may be expected to involve both these 
factors to some degree. It was suggested that the lattice distortion on 
oxidation would probably increase, causing an increased resistivity, 
while the electron scattering near the solute ions probably decreases, 
tending to cause a decreased resistivity. If, on oxidation, very large 


* This number refers to the list of references at the end of the present paper 
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particles of oxide are precipitated then the strain in the lattice will not 
be sufficient to increase the resistivity. If the scattering of electrons 
near the solute ions is large as in the case of trivalent or quadrivalent 
ions like aluminium and titanium, then there will be a marked tendency 
for the resistivity to decrease when the solute element is oxidized. In 
silver alloys with magnesium and beryllium in solution, the increase in 
resistance, due to increased distortion in the lattice after oxidation. 
appears to be a more effective factor than the tendency to decrease the 
resistivity by decreasing the electron scattering. In all other cases the 
effect of the decrease of electron scattering outweighs the distortion 
effect 

Also considering alloys, Busk and BopaLex"* determined the effect 
of hydrogen on the conductivity of magnesium alloys. The results of 
their investigation are shown in Fig. 14. The last point on the diagram 
was obtained after the specimen was saturated with hydrogen after it 
was charged for 4 hours in an alkaline solution. Prior to saturation the 
specimen was exposed to a normal atmosphere for the times shown, 
during which period hydrogen was absorbed by the lattice as indicated 
in another experiment. No suggestions were given to account for the 
curve. It was shown that the initial rise in conductivity was not due to 
the precipitation of a second phase during the initial stages of gassing. 


Mi scellaneous E ffe cls 

Other interesting effects caused by gases in various metals are discussed 
in the remainder of this section. Since each effect is a relatively isolated 
phenomenon, only a small amount of study having been devoted to 
each one, this section has been separated from the others. 

The analogy between the solid solution of gases in metals, and liquid 
solutions has been demonstrated by experiments showing that the 
electrolysis of these solid solutions is quite possible. CogHN and his 
co-workers studied the hydrogen-palladium system™ and SEITH and 
KUBASCHEWSKI"™ reported a similar effect in the iron-carbon system. 
In each case the non-metallic element moves in the solid metal as a 
positive ion under the influence of the electric field. This agrees with 
the idea that hydrogen is ionized to form an electron and a positive 
hydrogen ion when it enters the metal. More recently De Borer and 
Fast have found that under suitable conditions of temperature and 
applied potential difference, the oxygen in solution in zirconium wires 
could be made to diffuse through the metal lattice as a negative ion. 
In one experiment a wire with 5 atomic per cent of dissolved oxygen 
was heated to 1640°C by passing direct current through it. After some 
hours the temperature had dropped at the negative pole and had risen 
to a white heat at the positive end of the wire. Subsequent x-ray 
analysis indicated a concentration of 16 atomic per cent of oxygen at 
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the positive end and a gradual transition to almost pure zirconium at 
the negative end. In another experiment the oxygen was driven back 
and forth at will, simply by reversing the polarity of the applied poten- 
tial. It was predicted by Barrer® that oxygen would behave as a 
negative ion. 

In the same work® it was shown that the lattice constants a and c 
of the hexagonal zirconium lattice both increase up to 10 atomic per 
cent of oxygen. Beyond that point only the c value increases. Measure- 
ments were made on specimens containing up to 30 atomic per cent 
oxygen. This greater increase of c is similar to the case for titanium 
reported more recently by CLaRK,”* although CLARK measured only 
to 0-5 per cent of dissolved gas. The specific gravity of the zirconium- 
oxygen samples was measured and it was found that this increased over 
5 per cent while the volume increased less than 2 per cent after absorp- 
tion of 30 atomic per cent oxygen. A calculation of the specific gravity 
was made assuming that all the oxygen is contained in the interstices 
of the zirconium. The calculated values agreed very well with the 
experimental data. From all this information, and from previous work 
by Fast, it was concluded that all oxygen atoms are contained in solid 
solution at interstitial positions. More recently, HALL, Martry and 
Rees" have confirmed by x-ray analysis that in samples containing 


up to 36 atomic per cent oxygen, the oxygen was in true solid solution 


in the zirconium. 





JAEGER, RosENBOHM and VEENsTRA™ have reported that the 
specific heat of silver freshly melted in air, or heated in air at 940°C 
for 5 to 10 hours was higher than that of silver which had been melted 
in a vacuum, or had been stored for several months after melting in air. 
Their results showed a marked change in specific heat when gas-free 
silver was heated in air above 650°C. This effect was ascribed to the 
presence of dissolved oxygen. 

Later, KeLttey™ determined the specific heat of pure tantalum, and 
of tantalum containing small amounts of hydrogen in solution over the 
temperature range 80°K to 295°K. The tantalum containing hydrogen 
behaved anomalously, the specific heat having a high maximum in the 
region 220° to 265°K, depending on the amount of hydrogen in solution. 
The height of the maximum and the difference in measurable heat 
content between pure tantalum and that containing hydrogen were 
found to be about proportional to the amount of hydrogen in solution. 
The temperature at which the maximum occurs increases as the 
hydrogen content increases. When specimens containing hydrogen 
were degassed the specific heat returned to the normal value for pure 
tantalum. KELLEY has postulated that differences in entropy between 
hydrogenated and gas-free tantalum may be attributed to differences in 


147 





PROGRESS IN METAL PHYSICS 


the number of possible ways of arranging the hydrogen ions in the 
tantalum lattice at the higher and lower temperatures. 





The effects of gases on the mechanical properties of metals will not 
be considered in great detail here. It is felt that much of the experi- 
mental work has been done on metals having metallic impurities which 
would make the results of the work insignificant. In some cases 
unsuspected internal oxidation may have occurred which would seriously 
distort the results of the experiment. In all cases where a second phase 
is formed (oxide or nitride) the hardening 
or embrittling effect is well known and 
need not be discussed. The effect of 








hydrogen, oxygen or nitrogen in true 
solution in those metals which only dis- 
solve small amounts of the gas does not 
seem to be really well known. Of those 
metals which dissolve relatively large 
amounts of gas, titanium has been the 
subject of the most intensive study. 
JAFFEE and CAMPBELL have studied 
the effect of oxygen, nitrogen and hydro- 
gen on the mechanical properties of 
iodide-refined titanium. Fig. 15 shows 
Hydrogen | a plot of the average cross-sectional 
ee en ee | hardness of the specimens versus their 
a gas contents. The results indicate that 
Atome % nitrogen may harden titanium to a 
Fig. 15. Average cross- slightly greater extent than oxygen, while 
sectional hardness valuesfor hydrogen actually appears to decrease 
alloys of iodide titanium the hardness slightly. This apparent 
with oxygen, nitrogen and ; 
Sienwen annem aol decrease was felt to be due to hardness 
CAMPBELL"™*), variations in the original titanium rod. 
Fig. 16 shows that, for the range of 
solubility considered, nitrogen increases the tensile strength more 
than oxygen, and correspondingly decreases the ductility more. The 
apparent high ductility found with 0-5 per cent of hydrogen was felt 
to be due to specimen variation and not to any effect of the gas. 
Frytay and Snyper have recently studied the effect of nitrogen, 
oxygen and carbon on the mechanical properties of titanium. This 
work is similar to that discussed above but is more complete. In 
general this more recent study has confirmed the results obtained by 
JAFFEE and CaMPBELL. In contrast to the substitutional alloys, where 
the solute element with the lowest solubility has the greatest strengthen- 
ing effect per unit concentration, it was found here that the interstitial 
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Fig. 16. Tensile strength and elongation of alloys of iodide titanium with 
oxygen, nitrogen and hydrogen (JAFFEE and CAMPBELL*),. 
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containing nitrogen, oxygen and carbon (FINLAY and SNYDER?"®*), 
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element with the lowest solubility of the three studied (carbon) has the 
lowest strengthening effect. However, it was also noted that the 
element (oxygen) with the highest solubility does not have the greatest 
hardening effect. Also JAFFEE and CAMPBELL showed that hydrogen, 
with a greater solubility in titanium than carbon, oxygen or nitrogen, 
has almost no strengthening effect. There does not appear to be suffi- 
cient information available to make a generalization at this time. 
FrnLay and SNYDER have studied the relationship between the amount 
of hardening and the extent of lattice-parameter increase effected by 
additions of the interstitial solute elements. This relationship is shown 
in Fig. 17. It was suggested that oxygen and nitrogen may behave 
similarly in their interactions with the bonding electrons in the alpha 
solid solution, thus accounting for the correlation between mechanical 
properties and parameter changes. 





A recent paper by Warn and Cotrre._’ describes one of the most 
interesting effects which gases have on metals. In this work it was 
shown that zinc crystals containing nitrogen have a sharp yield point 
when sufficiently strain-aged, while no yield point was found in nitrogen- 
free zinc. References are given in the paper to others which have shown 
that nitrogen also causes a yield point in iron, cadmium single crystals, 
and molybdenum polycrystals. Orowan had observed a similar effect 
in zine which he termed “thermal hardening,’’ but he did not attribute 
the yield point to the presence of a specific solute element. Unlike 
cadmium crystals, the zinc crystals would not absorb sufficient nitrogen 
from the atmosphere during growth, so the nitrogen had to be intro- 
duced in a preliminary operation, by melting the zinc under a flux or by 
bubbling the gas through the zinc. It was definitely shown that the 
effect was due to nitrogen present internally and not merely as a surface 
film. Stress-strain curves, made after various ageing treatments at 
room temperature, show that the elastic limit decreases due to recovery 
and the yield point does not appear for several hours. Almost complete 
recovery precedes the return of the yield point, which appears after 
ageing for 8 or 9 hours at room temperature. Ageing for longer time 
periods causes an increase in the yield point. At higher temperatures 
the yield point appeared after shorter ageing times. From experimenta! 
results it was deduced that the process is thermally activated, with an 
activation energy of 21,000 cal/mole. The orientation of the crystals 
was found to have no effect on the magnitude of the yield point. The 
theory presented to explain this effect was that: “. . . the yield point 


is caused by the segregation of solute atoms to dislocations. The 
attraction of the dislocations to the segregated atoms provides a bond 
which has to be broken by a larger force than is necessary to maintain 
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freed dislocations in motion; the material thus gives way suddenly 
and softens, at the start of plastic flow, producing a sharp yield point. 
The theory also explains the observed removal of the yield point by 
plastic overstrain and its return on strain-ageing. A freshly strained 
specimen contains freed dislocations and does not show a yield point, 
but on ageing these dislocations become anchored by the migration of 
solute atoms to them and the yield point returns.” 

It is believed that, in each of the metals in which the yield pheno- 
menon has occurred, the nitrogen is present in interstitial solution so 
that it distorts the lattice of the metal and diffuses very rapidly, condi- 
tions which must be fulfilled to agree with the theory suggested. It was 
predicted that marked yield points may not be found in face-centred 
cubic metals ‘‘because of the difficulty of anchoring screw dislocations 
in this type of lattice.” 

It was previously pointed out that one consequence of the theory of 
nitrogen atoms migrating to dislocations (in alpha-iron) is that the 
electrical resistance should decrease by a small, predictable amount on 
strain-ageing. This effect was examined by CoTTRELL and CHURCH- 
MAN, * and was found to exist. 


SUBSURFACE OXIDATION OR INTERNAL OXIDATION 


Although, as mentioned earlier, surface oxidation will not be discussed, 
the phenomenon of internal oxidation is considered to be within the 
scope of the present review. 

Generally, if an alloy contains a solute element with greater affinity 
for oxygen that the solvent metal, the minor constituent may oxidize 
below a surface oxide formation, in an area where the partial oxygen 
pressure is too low to cause oxidation of the major element. This 
subsurface oxide will not flake off as may the surface layer of oxide of 
the solvent metal. 

Although this effect was observed by TURNER! as far back as 1912, 
relatively little has been published until recently. Although they both 
observed internal oxidation in commercial copper during annealing, 
Biazey’™ and later Wyman? did not consider this to be as important 
as other factors causing embrittlement. C. S. Smrru’® is responsible 
for the term “‘subscale’’ which is used to denote the layer of subsurface 
oxide, and he appears to have been the first to make quantitative 
observations of the phenomenon. FROHLICH’® has remarked on the 
occurrence of internal oxidation of several copper alloys but did not 
consider the problem in detail. Leroux and Ravs!® found a precipita- 
tion structure of both cupric and cuprous oxide in silver-copper alloys. 
The cupric oxide occurred in alloys poor in copper, and the cuprous 
oxide in those rich in copper. Most of the experimental work in this 
field has been done with copper as the solvent metal, but MeIyERING?® 
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describes the properties of alloys of silver, copper, nickel, and iron 
respectively, with small amounts of other metals, after treatments 
leading to internal oxidation. 

In one paper, Rutves® has classified the modes of occurrence of 
subsurface oxidation as one or a combination of three types: “‘(1) 
general precipitation of separate particles of the oxide of the alloying 
element within a matrix of the major metal, (2) deposition of relatively 
thin layers of the oxide of the alloying element along the grain boun- 
daries or special planes of the major metal, or (3) envelopment of 


particles of the major component in a matrix of the oxide of the 


alloying element.’’ The first type, which is generally indicated when 
one mentions “internal oxidation,” occurs in dilute alloys at high 
temperatures. The second type occurs at lower temperatures with 
greater as well as smaller amounts of the alloying metal. The third 
type can only occur when there is a relatively large amount of the 
oxidized metal present. In the same paper Raryes has described some 
very interesting practical aspects of subsurface oxidation. 

The actual mechanism of internal oxidation seems to be quite simple. 
As prerequisites for internal oxidation to occur, Ratves* has indicated 
that there must be a solubility of oxygen in the alloy and the oxygen 
must be able to diffuse fairly rapidly into the alloy. The minor con- 
stituent must form an oxide which is more stable than the oxide of the 
major component, and which is insoluble in the alloy. If one considers 
the system as a ternary alloy, then the constitution of the system must 
allow a quasibinary equilibrium to occur between the solid solvent and 
the oxide particles at the oxidation temperature. 

Fig. 18 shows typical internal oxidation. At the bottom of the 
picture one may see a relatively thin dark layer which probably consists 
mainly of cuprous oxide in this case, with an oxide of manganese. This 
formation is not actually described by Rutves, but would be analogous 
to one of a copper-silicon alloy described in an earlier paper. The 
second, and larger, region consists of copper-manganese solid solution 
plus a precipitate of a manganese oxide. It is this layer which is the 
area of internal oxidation. Finally, the region at the top of the photo- 
graph is simply a copper-manganese solid solution. By oxidizing at a 
low oxygen pressure a subscale may be formed without oxidizing any 
of the solvent metal, thus leaving an apparently clean surface even 
though a considerable depth of oxidation has occurred. This effect is 
shown in Fig. 19. 

To explain the mechanism of internal oxidation Rutves** has used 
the diagram shown in Fig. 20. 

As has been noted earlier in this paper, there is a considerable 
solubility of oxygen in copper. Thus it is possible for the diffusing 
oxygen in the metallic lattice to come into contact with the alloying 





Fig. 18. Typical Internal Oxidation. Alloy of copper 0-4 per cent 
manganese oxidized at 1000°C in air for & hours 50 (RHINES* 


Fig. 19. Internal oxidation in alloy of copper with 0-103 per cent 

silicon 50 Sample oxidized at low oxygen pressure has no 

external scale. Internal oxidation is marked by zone crossing lower 
half of picture (Ruinges'** 
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element. If the oxide of the alloying element is more stable than 
cuprous oxide, under the conditions of temperature and oxygen pressure 
imposed, then the solute metal oxide will form. This preferential] 
oxidation of the alloying element may take place beneath the external 
oxide scale or, if the external oxygen pressure is not sufficient to form a 
scale, it may extend to the apparently clean surface (Fig. 19). Once the 
solute metal oxide has formed it will not remain in solution very long 
but will separate from the lattice as a precipitate. As the solute metal 
is exhausted by this precipitation, the oxygen supplied to the surface 
of the subscale (either from the external atmosphere or from the copper 
oxide layer) diffuses farther into the alloy and thus produces a zone of 





Fig. 20. Schematic representation of distribution of dissolved oxygen 
and alloying element A in subscale and unoxidized solid solution 
(RHINEs?**), 


ever-increasing thickness. At the same time, the solute metal com- 
ponent must diffuse in the opposite direction toward the advancing 
front of scale to replace the decreasing concentration of solute atoms 
being precipitated as oxide particles. Thus Rarves*® points out that 
the rate of subscale growth depends, among other conditions, upon the 
diffusion rates of both oxygen and the reacting solute element. To 
confirm this theory, it has been found that in many alloy systems, the 


square of the subscale depth is a linear function of the time of reaction 


at constant temperature. It was shown in another paper’® that, if 
melting does not occur, internal oxidation of copper alloys occurs 
between 750° and 1000°C as a relatively simple diffusion process in- 
volving the movement of gas and metal atoms. This process was des- 
cribed by mathematical expressions. The oxidation process investi- 
gated at 600°C was found to be more complex. DarkKEN’® has also 
discussed the mechanism of subscale formation. He has shown how the 
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rate of growth and composition of the zone may be predicted from a 
knowledge of diffusion constants, phase relations, and boundary 
conditions 

tuINES found that the particle size of the precipitate depends upon 
the chemical nature and concentration of the solute element and the 
temperature of oxidation. The more stable oxides tend to form fine 
precipitates, while the less stable ones precipitate as coarse particles. 
High temperature and high concentrations of the alloying element 
both promote large particle size. Usually the oxide particles are well 
distributed throughout the zone of oxidation. 

The precipitate distribution depends on the system and the tempera- 
ture of oxidation. In some cases the density of oxide particles is greater 
near the metal surface, in other cases the reverse is true, depending on 
the relative diffusion rates. As the temperature of oxidation is lowered 
the oxide particles tend more to occur at the grain boundaries rather 
than in the uniform distribution characteristic of high temperature 
oxidation. 

{HINES has also found the rate of formation of the zone of oxidation 
to decrease with time, with an increased content of the solute element, 
with lower temperatures and with decreased oxygen pressure. The 
time to form a given thickness of subscale increases almost directly 
with increase in the concentration of the solute metal. 

In ternary alloys, where the two solute elements form oxides of 
different solubilities, double zones of internal oxidation occur. The 
outer zone contains the oxides of both alloying elements, while the 
inner zone contains only the more stable oxide of the two. By altering 
the alloy composition one may change the dimensions of the subscale. 
By changing the oxygen pressure selective oxidation of the solute 
element forming the most stable oxide may be promoted. 

In 1947 Rutves’® summarized his work, and in the review mentioned 
that the surface hardening of alloys due to the presence of uniformly 
distributed oxide precipitate does not seem to be possible. However, in 
1945, Cuaston’” had already noted what he called “precipitation- 
hardening effects’’ and used associated properties to develop his theory 
of water vapour embrittlement of silver (see section on Embrittlement). 
CuasTon was able, by careful oxidation, to obtain a marked increase 
in the Vickers hardness of silver alloys with small amounts of aluminium 
or zine in solution. It was noted that there is an optimum solute con- 
tent required to give the maximum hardening effect for a given oxida- 
tion treatment. Larger amounts of the solute in solution appear to 
hinder the diffusion of oxygen into the subsurface layers of metal. It 
was found that the depth of the oxide subscale was greater in alloys of 
low solute element content, and the depth could be increased by 
increasing the oxidation time. Commercially pure silver could be 
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hardened by heating in air followed by heating in an inert atmosphere 
containing zinc vapour. The zinc vapour apparently diffused into the 
silver to combine with the dissolved oxygen, thus causing a hardening 
effect. Spectroscopically pure silver, when solid, could not be hardened 
in this way, since it will not, according to CHasTon, dissolve oxygen 
(see section on Embrittlement). Similar hardening was obtained by 
heating silver in air containing zinc vapour. In this case the zone of 
internal oxidation formed by heating silver for 4 hours in air containing 
zinc vapour was found to extend 0-04in. below the metal surface. 
Shortly after Rurnes’ review was published, MreIyERING and Droy- 
VESTEYN", 165, 171 reported their finding substantial surface hardening 
caused by internal oxidation in certain binary alloys. This hardening 
effect has recently been confirmed by SmirH and Dewurrst.!”* 

In their paper MEIJERING and DRUYVESTEYN point out that “disper- 
sion hardening’’ by blocking the glide planes with minute particles is 
effectively carried out by internal oxidation. By heating strips of silver 
alloys for 2 hours in air at 800°C they obtained the following results : 


Silver + 0-3 weight per cent magnesium + 170 Vickers (kg/mm?, load: 2} kg) 
Silver + 0-4 weight per cent aluminium + 160 Vickers 


Silver + 0-2 weight percent beryllium + 135 Vickers 


Similar strips annealed in nitrogen or hydrogen had hardnesses of from 
30 to 50 Vickers. 

The success of a hardening operation depends on the difference in 
stability of the oxides of the two components of the metal alloy. 
Experimental results have shown that although the oxidation of a given 
percentage of a solute element will increase the hardness of silver, the 
same treatment may not harden copper to the same extent. Similarly 
an oxidation treatment which hardens a copper-base alloy may not 
harden a nickel-base alloy due to the greater stability of nickel oxide. 
The difference between hardening and non-hardening alloys has been 
explained on a thermodynamical basis by MEIJERING and DRUYVESTEYN. 

With respect to the precipitation of the oxide particles, on which 
depends the resultant hardening effect, MEIJERING and DRUYVESTEYN 
have shown that at very high magnifications (1200 x ) no particles were 
visible. The oxide will be first formed in a molecular state, the molecule 
being immobile in the alloy lattice. After some time period this 
molecule may dissociate (the rate decreasing with increase in stability 
of the oxide) and the atoms of the oxide will move to “‘form greater 
(and, therefore, more stable) complexes.’’ The less stable oxides produce 
coarser particles which have practically no hardening effect. 

X-ray measurements showed that the lattice spacing of silver con- 
taining 0-36 per cent magnesium, and silver containing 0-26 per cent 
aluminium increases considerably by oxidation to a value larger than 
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that of either pure silver or of the unoxidized alloy. The influence of 
dissolved oxygen in the silver is quite small. This result was interpreted 
to indicate that a considerable part of the magnesium and aluminium 
oxides are dissolved in almost molecular dispersion in the silver lattice. 
Those alloys of silver with zinc and cadmium which did not harden had 
a spacing about the same as that of pure silver, which indicated the 
formation of larger oxide particles. Copper alloys which have been 
hardened retain a lattice spacing almost equal to that of pure copper. 

Electrical resistance measurements made by MEIJERING and Drouy- 
VESTEYN show that the resistivity of the oxidized copper alloys decreases 
to a value approaching that of pure copper. In the silver alloys, the 
hardened specimens had a resistivity higher than pure silver (in two 
cases, with magnesium and beryllium, the resistivity increased on 
oxidation) and only the non-hardenable zinc alloy decreased to the 
value of pure silver. This high resistivity of the hardened metals 
has been assigned to two causes: (a) the silver lattice may be distorted, 
and (6) the electric field near a foreign ion or atom may differ from that 
near a silver ion. On oxidation the distortion effect is increased by 
the oxide molecules on the lattice. In cases other than magnesium and 
beryllium cited above, the slight decrease in resistivity is caused by 
annealing which may in some cases be larger than the effect caused by 
the oxide particles. 

Generally it was found that serious embrittlement is caused in poly- 
crystalline specimens which have been hardened to about 140 Vickers 
or more. The elongation in tension is often reduced to 1 or 2 per cent. 
However, since the brittleness was of an intercrystalline nature, single 
crystals hardened by oxidation remained quite ductile (28 per cent and 
21 per cent elongation). 

In defence of his failure to recognize the hardening effect, Rurnes® 
attributes the cause to the fact that he used a ‘“‘micro-character scratch 
hardness tester’’ while other investigators used micro-indentation 
measurements. From this he concludes that there is no matrix harden- 
ing but an ‘aggregate hardening effect, which is less likely to be reported 
by a scratch test.” This conclusion is quite at variance with the x-ray 
measurements made by Meerrmne and Druyvesteyn™ which 
indicated a lattice expansion of the alloys which had been hardened. 

On the basis of unpublished work done by THomas,’ Rutves® 
states that: “It has been supposed, largely upon the basis of studies 
by Frou.icu,'** that internal oxidation is not to be expected in alloys 
in which the minor element is noble with respect to the major element. 
An exception to this has been reported recently by THomas,'** who 
studied the oxidation behaviour of copper-palladium and copper- 
platinum alloys. Subscales were found in alloys containing as little as 


* This number refers to the list of references at the end of the present paper. 
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5 per cent of the noble metal. The oxide of the subscale was, in these 
cases, composed of cuprous oxide containing the noble metal in solid 
solution.’’ From the brief description given by RHINEs it does not seem 
as if this phenomenon is one of internal oxidation in the usual sense of 
the term. It has been usually accepted that the minor component of 
a binary alloy must behave according to the requirements set out earlier 
in this section, i.e. it must oxidize preferentially to the major constituent 
etc. This is not the case here, where the major component is oxidized. 
It has not been made clear by Rutnes how the noble metals are in any 
way responsible for this “‘internal oxidation.’”’ This phenomenon may 
be some secondary effect due to the presence of the noble metals, but 
they are not directly responsible for the “‘internal oxidation.”’ 


Gas EVOLUTION ON SOLIDIFICATION OF METALS 


Although this review is not concerned with gases in molten metals, it 
may be well to consider briefly the effect which gases, evolved during 
solidification of metals, have on the resultant solid metals. The 
evolution may be caused simply by the fact that molten metal at the 
freezing point will absorb much larger quantities of gas than the same 
solid metal at the melting point. On solidification of the metal, the 
excess of gas comes out of solution, forms bubbles, and is trapped as 
minute cavities or blowholes in the casting. Hydrogen generally 
appears to be the major offender in causing defects of this type, parti- 
cularly in iron, copper, nickel, and aluminium. Oxygen in silver causes 
a similar effect, known as “sprouting.” 

If the metal contains dissolved oxygen as well as hydrogen, then on 
solidification of the metal, these gases may react under certain conditions 
to form steam. This ‘‘reaction unsoundness,”’ as it is called, has been 
studied by ALLEN and his co-workers. In a very detailed study, 
ALLEN?” described the porosity due to the simple evolution of hydrogen 
from copper, and went on to say that when oxygen was present as well 
as hydrogen the voids were smaller, more numerous, and occurred in 
the crystal boundaries. The amounts of oxygen and hydrogen required 
to cause this unsoundness were very small, of the order of 0-002 per cent 
and 0-0001 per cent by weight respectively. ALLEN concluded also 
that: ‘The phenomenon results from a reaction, taking place during the 
process of freezing, between cuprous ‘oxide and minute residual traces of 
hydrogen, whereby steam is generated. ... Cuprous oxide and 
hydrogen can exist together in molten copper.’’ Sulphur would have 
the same effect as hydrogen in reducing the oxide of copper. ALLEN 
suggested that the “reaction unsoundness’” phenomenon is more 
important commercially than simple gas evolution. In a later paper 
ALLEN and STREET described similar effects in copper-nickel alloys. 
It was concluded that unsoundness could be caused by an amount of 
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hydrogen below that needed for pure hydrogen unsoundness provided 
that the oxygen content was greater than about 0-001 per cent for alloys 
containing less than 25 per cent nickel, and about 0-01 per cent for 
alloys of more than 30 per cent nickel. This critical oxygen content is 
related to the solid solubility of oxygen in the alloys. ALLEN’ also 
studied the effect of pressure on the evolution of gases from metals, 
using the silver-oxygen system as an example. 

It was stated in the introduction to the present paper that there would 
be no discussion on degassing. However, for the reader who may be 
concerned with this problem, there is a section in the supplementary 
reference list which gives a few of the many papers which have been 
written on porosity, unsoundness and degassing. The very brief 
description of the problem which has been given here is not intended 
to be a review of the latest work as much as it is an introduction to a 
commercially important aspect of the gas-metal problem. To the 
person interested in physical metallurgy there may not be as much 
interest in this problem as to the foundryman. At any rate further 
details may be found in the papers referred to in the supplementary 
list. Since many of the papers discuss both porosity and degassing, no 
attempt has been made here to indicate which papers deal with any 
particular study. 


EMBRITTLEMENT OF METALS BY GASES 


The phenomenon of embrittlement of metals by gases has long been a 


problem in the metal industry and seems to re-occur as an unsolved 


problem at various times. Here the mechanism of gas embrittlement will 
be discussed, and the causes mentioned. The remedies will not be 
considered here, but may be found in some of the papers which are 
referred to below 

At the outset, the writer wishes to emphasize a point which was 
mentioned by Martry and Parker.’ The term “hydrogen embrittle- 
ment,’’ which is correctly used in referring to iron and steel, for 
example, becomes a misnomer when it refers to the common embrittle- 
ment of copper and silver. This latter phenomenon will be referred to 
here as “water vapour embrittlement” in recognition of the fact that 
both hydrogen and oxygen are necessary to cause embrittlement in 
solid copper. Water vapour embrittlement will be discussed first, and 
true hydrogen embrittlement will be considered later. 

In a series of papers, WyMaN,'” in discussing copper embrittlement, 
has shown that alternately annealing in oxidizing and reducing 
atmospheres causes water vapour embrittlement. It was thought that 
during the oxidizing cycle, oxygen diffuses into the metal and forms 
cuprous oxide. It will be shown later that compound formation is not 
at all necessary for embrittlement. During the reducing anneal, 
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hydrogen combines with the oxygen to form water vapour which 
ruptures the grain boundaries. 

WyYMan’s investigations led to some rather important conclusions 
concerning commercial annealing practice : 

“1. Neither pure copper when oxidized or deoxidized nor tough- 
pitch copper up to 0-06 per cent oxygen is embrittled by hydrogen at 
temperatures of 400°C or below, at four hours. 

“2. (a) The rate of oxygen penetration into ‘pure’ copper decreases 
markedly below 800°C. 

““(b) The rate of hydrogen penetration into oxygen-bearing copper 
decreases markedly below 700°C. Consequently, up to about 800°C, the 
coppers containing no oxygen are superior to tough-pitch coppers for 
resistance to the embrittling action of oxidation-reduction cycles. . . . 

“5. The elimination of oxygen by purifying process or by means of 
deoxidation of the copper is advantageous in increasing the resistance 
to the embrittling action of reducing gases.”’ 

In the final paper of WyMAN’s series it is shown that a steam atmo- 
sphere alone, even for periods of 8 hours at temperatures of about 1000°C, 
is not sufficient to embrittle the copper. Any iron in the furnace will 
reduce the water vapour and make hydrogen available for the embrittle- 
ment of the copper, but this action is only important at temperatures 
above 700°C. It was concluded that (with oxidized copper) hydrogen 
will cause embrittlement even at 500°C if there is sufficient gas. 
Wyman feels that the sources of gas are dirt or grease on the copper, 
volatile products from enamel, dirt, grease, lubricants, or other materials 
carried into the furnace extraneously. 

Runes and ANDERSON’ have definitely established that water 
vapour embrittlement is caused in copper containing oxygen, either in 
solid solution, or as precipitated cuprous oxide, when hydrogen is 
present in the annealing atmosphere. The embrittlement was found to 
be more severe when cuprous oxide particles were present. RansLey!”® 
had shown that pure dry carbon monoxide will not diffuse into copper, 
and removes oxygen from the copper without causing embrittlement. 
This was confirmed by Rutvges and ANDERSON who found that copper, 
annealed in carbon monoxide, was not embrittled on subsequent 


heating in hydrogen. In this same paper it was shown that hydrogen 


could reduce foreign metal-oxides, present as impurities in the copper, 
thus causing embrittlement. The oxides which, when reduced, caused 
the most severe embrittlement were those of indium, manganese, 
nickel, phosphorus, tin, and zinc. The oxides of columbium, gallium, 
lithium, magnesium, and zirconium were the least detrimental of those 
investigated, and the oxides of aluminium, arsenic, beryllium, bismuth, 
boron, calcium, cerium, cobalt, chromium, iron, lead, antimony, and 
titanium had an intermediate effect. Associated with RxHINEs’ studies 
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of internal oxidation (q.v.), a minor type of embrittlement has been 
found to occur when the oxides of the subscale were reduced by the 
hydrogen. Another type of embrittlement, not mentioned above, was 
also found when copper, containing small amounts of arsenic, antimony, 
or bismuth, was annealed in hydrogen. This effect was thought to be 
due to the formation of the volatile hydrides of these metals creating a 
pressure sufficient to rupture the alloy 

Many separate reports are to be found in the literature on the 
embrittling action of water vapour which is usually produced within the 
metal on annealing in a hydrogen-bearing atmosphere. Most of the 
investigations have been purely empirical and in many cases the results 
are applicable only to the particular conditions under which the work 
was done. In many cases rather complex alloy systems have been used, 
so that there were several variables to be considered. This, of course, 
often leads to invalid assumptions and erroneous conclusions. The 


; 


present writer would prefer to review just a few of the outstanding 


papers, including those discussed above. Although the following 
remarks apply particularly to copper and silver, it must be recognized 
that similar phenomena must be expected wherever one is treating a 
metal which will dissolve both oxygen and hydrogen in suitable 
quantities 

In 1926, Smire and Haywarp" presented evidence which was 
interpreted as showing that a sintering action occurs in copper when 
heated above 800°C after having been embrittled by a hydrogen treat- 
ment. It was observed that cast copper containing less than 0-06 per 
cent oxygen could be deoxidized in a relatively short time by annealing 
at a high temperature in hydrogen. However, with copper containing 
larger amounts of oxygen (0-10 per cent), it was noted that, after 
annealing at very high temperatures, the copper had much better 
electrical and mechanical properties than similar copper annealed at 
only 800°C. From metallographic and other evidence it was suggested 
that at the lower annealing temperature, the hydrogen diffuses into the 
copper and reacts with oxide particles forming steam, which blows the 
metal apart. This initial rupture will allow more rapid penetration of 
the hydrogen, and the copper oxide eutectic areas rapidly become 
spongy networks of microscopic cracks. In the case of high-oxygen 
copper, it is claimed that the stress created by the steam formation 
strains the metal which, on heating, recrystallizes and during this 
process the cracks are healed. With low-oxygen copper, the oxide 
particles form in globules along the grain boundaries. It is suggested 
that the hydrogen can reduce this oxide with little effect on the copper 
lattice. Since no strain is present, the copper will not recrystallize or 
sinter, so the metal remains brittle. 

An excellent paper by Ransiey'” indicates that the mechanism of 
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embrittlement suggested above is unlikely. From data available at that 
time, RANSLEY made the following conclusions : 

“‘(a) Compound gases do not diffuse through solid copper. Dissocia- 
tion of molecules adsorbed at the surface may, however, be followed by 
penetration of the constituent atoms. 

““(b) The embrittlement of copper in reducing atmospheres may in all 
instances be ascribed to penetration by hydrogen. When embrittlement 
occurs in gases such as carbon monoxide, hydrogen is present as an 
impurity, or is produced either by dissociation at the surface, or by a 
side reaction, e.g. CO + H,O = CO, + H,. 





Origine! oxygen content (p-s0mm 
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Fig. 21. Removal of oxygen from copper at 850°C in hydrogen. 
e 760 mm pressure < 10-0 mm pressure 5-0 mm pressure 
@® 2-0 mm pressure 
Broken line indicates theoretical diffusion of oxygen (RANsLEyY’”®). 


“(c) Reaction between gases in solution with the formation of a new 
phase cannot occur in the lattice. Reaction can only occur at an inter- 
face, i.e. in the present case at the copper surface, at an oxide inclusion, 
or possibly at a grain boundary.”’ 

Since cracking does not necessarily originate at oxide particles, and 
since the effect always seems to be intergranular, it was suggested that 
the hydrogen and oxygen can exist in equilibrium in the lattice as 
separate atoms, but they diffuse to the grain boundaries and there 
react to form water vapour which ruptures the metal. 

On the basis of results published by Prttrnc, Rans.ey has suggested 
a mechanism which depends also on the diffusion rate of oxygen in the 
copper, a point neglected by SmirH and Haywarp. At low hydrogen 
pressures (less than 26 mm Hg), it was noted that the rate of deoxida- 
tion of copper at 700°C in hydrogen depended on the rate of diffusion of 
oxygen in the copper, and not on the hydrogen pressure. This meant 
that the deoxidation was a surface process. Further experiments were 
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performed by RANSLEY, who deoxidized copper at 850°C with hydrogen 
ut various pressures. The time-loss of oxygen curves obtained are shown 
in Fig. 21. At 760 mm pressure the metal was rapidly deoxidized and 
became very brittle. At lower pressures there was no significant change 
in the specimens until the conditions marked A,, A, and A, were 
reached. At these points the specimens suddenly developed blisters. 
This blistering was attributed to the formation of water vapour beneath 
the ductile, deoxidized skin of the metal. RaNnsLey has pointed out 
that blistering only took place when the solid deoxidation curves 
crossed the broken curve, which indicates the normal rate of diffusion 
of oxygen to the surface at 850°C. These curves indicated to RANSLEY 
that, at a given hydrogen pressure, as long as the supply of oxygen 
available at the surface was sufficient to react with the hydrogen, then 
there would be no embrittlement. As the deoxidized layer becomes 
progressively thicker, however, there is a progressive decrease in oxygen 
diffusion, and finally the hydrogen will be able to enter the metal and 
cause water vapour formation and consequent rupture of the metal. 
RANSLEY concluded that: ““The removal of oxygen from copper in 
hydrogen will thus be a function of the oxygen content of the copper, 
the hydrogen pressure, the temperature, and the time of heating.” 


As was stated earlier, RANSLEY’s work has also definitely shown that 
copper may be completely deoxidized, without deleterious effects on 


the mechanical properties, by annealing in carbon monoxide free from 
hydrogen. Carbon monoxide does not diffuse into solid copper. 

Martin and Parker’ found that silver, containing dissolved 
oxygen, suffered a decline in strength and ductility when heated in 
hydrogen. This effect is analogous to that in copper noted by RxHINEs 
and AnpEeRsoN,'” who found that water vapour embrittlement could 
occur without the presence of cuprous oxide particles. Martin and 
PARKER also found that, due to the rapid absorption of oxygen by 
silver, it is possible to embrittle silver by first annealing it in hydrogen 
and then in air. It was also found that when the silver specimens were 
annealed in air, a fine-grained structure was present near the surface 
but larger grains formed in the body of the metal. This grain-growth 
inhibition was apparently due to some property of the surface material 
only. When a layer of the surface metal was machined off the specimen 
prior to annealing, it was found that the fine grain structure did not 
form. It was concluded that the original specimen was not homogeneous, 
but no further tests were made. As was expected, specimens of gold 
and platinum showed no embrittlement on annealing in hydrogen 
after heating in air. The embrittlement of silver by heating in hydrogen 
did not oecur below 500°C. 

CHasSTON!” studied the inhibition of grain growth mentioned above, 
and also noted by Ruryves and Grose." In the experimental work on 


162 





GASES IN METALS 


commercially pure silver (99-98-99-99 per cent silver) CuasTon ob- 
served that annealing in an oxygen-free atmosphere at temperatures 
above 600°C caused rapid growth, while the same treatment in air 
caused impeded grain growth in the areas to which oxygen had diffused. 
By heating this silver in air at relatively low temperatures (500°C) 
where grain growth is slow even in vacuo, enough oxygen may diffuse 
into the metal to prevent any grain growth even at 800°C. On the 
other hand, it was also observed that grain growth occurred to an equal 
degree in spectroscopically pure silver when it was annealed at 600°- 
800°C in air as when annealed in hydrogen. It was found that an 
addition of 0-01 per cent copper, iron or lead to this silver restored the 
grain growth inhibition. CHasTon attributed the grain growth inhibi- 
tion to the particles of the metallic impurity oxides formed by internal 
oxidation. 

The results of the work of MarTIN and PARKER were not in agreement 
with SmiTHELLS'** who claimed that silver saturated with oxygen 
could not be embrittled by hydrogen “‘because oxygen diffuses so 
rapidly out of silver that it has all escaped before the hydrogen can 
diffuse in.’’ Actually embrittlement by oxygen diffusion into hydrogen- 
saturated silver was found to be a minor effect, contrary to the opinion 
of SMITHELLS, who felt that the oxygen would diffuse in much faster 
than the hydrogen would diffuse out. However, Cuaston confirmed 
the possibility of embrittling silver by heating first in oxygen and then 
in hydrogen, the reaction between the gases occurring at temperatures 
above about 300°C. By various experiments it was learned that the 
depth and severity of embrittlement was controlled by the treatment 
in oxygen, and that the variations in the hydrogen treatment had no 
effect, provided the metal was above 400°C. 

CHASTON was not able to confirm the observation by MARTIN and 
PARKER that oxygen-free silver, annealed first in hydrogen, could be 
embrittled by heating in oxygen. CHasToN believes that the effect 
noted by MarTIn and PaRKER was due to small amounts of oxygen 
retained in the silver prior to the hydrogen anneal. CHASTON was not 
able to produce embrittlement in silver with less than about 0-01 per 
cent of total impurities. The exact limiting amount of impurity which 
will restrain grain growth or promote embrittlement was not deter- 
mined. Another important observation was that these effects are not 
affected by the rate at which silver is cooled after annealing in air. 
CHASTON has claimed that this last observation is an argument against 
the possibility that dissolved oxygen may be an important factor in the 
embrittlement reaction. For this, and other reasons, a new mechanism 
of embrittlement was proposed involving the concept of a subscale 
formation of impurity metal oxides in the silver. CHASTON suggests 
that this finely dispersed phase would (a) hinder grain growth, (6) react 
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with hydrogen to embrittle the silver, and (c) harden the metal (see 
section on Internal Oxidation). It was considered that such a formation 
would account for all the observations. One weakness in the idea, 
suggested by CuHasTon himself, is that the oxygen may be removed 
from the silver by a vacuum treatment below the melting point. One 
would not expect this to occur if relatively stable oxides had been 
formed. However, it was claimed that: “It is not inconceivable, 
however, that in these conditions the very finely divided oxides distri- 
buted through silver may be capable of reduction to the metallic state.”’ 

Another effect, not strictly embrittlement, has been studied by 
Eporatt and Ransiey.'** When an aluminium-magnesium alloy 
containing 7 per cent of magnesium, is heated with water vapour in the 
temperature range between the solidus and the liquidus, the metal 
blisters and blackens. The blistering is due to the absorption of 
hydrogen which is released at surfaces in the interior of the metal 
when the concentration of hydrogen in the metal has reached a certain 
value. At lower temperatures (400°C) it was thought improbable that 
the hydrogen alone would cause blistering. However, if severe flaws 
were present in the metal, they could be enlarged considerably by 
annealing in a moist atmosphere. 

ZaPFFE has considered the above type of problem" and has suggested 
the concept of the “hydrogen potential.’’ A hypothetical system is 
considered to consist of a closed chamber divided into two compart- 
ments by a diaphragm of the metal being studied. Steam, at a constant 
partial pressure of | atm, is admitted to one compartment. If the metal 
membrane is initially saturated with oxygen, the steam will react 
with it to form metal oxide and hydrogen. Some of the atomic hydrogen 
will diffuse through the membrane forming molecular hydrogen in the 
second compartment, some will remain in solution in the metal, and 
some will form molecular hydrogen inthe steam. The limiting equilibrium 
hydrogen pressure which is equal in both compartments is called the 
“hydrogen potential,’ and is a measure of the hydrogen absorbed by 
the metal. The case is also considered where the membrane is not 
initially saturated with oxygen. It is shown from this concept, for 
example, that nickel has a very low “hydrogen potential’’ which is 
related to the fact that nickel does not react with steam. The “poten- 
tials’’ of aluminium and magnesium, which react with steam, are much 
higher. The measure of hydrogen potential is related to the tendency of 
the metal to react with water vapour and, therefore, to form blisters. 





Several years ago Prer,’** when experimenting with samples of 
almost pure iron containing crystals of about | in. in diameter, found 
that when such a specimen had been pickled for 30 minutes in 20 per 


164 





GASES IN METALS 


cent sulphuric acid, the crystals could be broken away from each other 
at the crystal boundaries. Apparently the metal was so brittle at the 
boundaries that fracture occurred without any marked deformation in 
the body of the individual crystals. Since that time, many experiments 
have been done to study the detrimental effects of hydrogen in iron and 
steel—particularly embrittlement and hair-line crack formation— 
caused by prolonged soaking in hydrogen atmospheres, pickling, or 
electrolytic plating. A survey of the extensive literature has shown 
that little is known concerning the fundamental factors involved in 
these problems. Many of the papers are purely descriptive and deal 
with relatively complex systems which are not suited to a study of the 
problem. The writer appreciates the importance to industry of knowing 
how to overcome these effects in certain special cases; however, one 
feels that it is more important to answer the fundamental, ““Why ?”’ 
Fortunately, in more recent years, there have been a few studies 
devoted to answering the question. 

Some of ANDREW’s earlier papers dealt with the effect on mechanical 
properties and hair-line crack formation of hydrogen introduced into 
laboratory specimens of steels. Later, Sykes, BurRToN and Grao"®’ 
attempted to interpret this work in terms of industrial practice. The 
hydrogen contents of a number of steels were determined at different 
stages of the steel-making process. It was confirmed that a hydrogen 
content greater than 2 c.c. per 100 g would reduce the steel’s ductility. 
This is significant since it was also found that all the steels produced 
contained 4-6 c.c. of hydrogen per 100g. Depending on the thermal 
history of the specimen, high hydrogen contents may or may not cause 
hair-line cracks. Although other factors may modify the phenomenon, 
it was definitely concluded that steel with a high hydrogen content is 
more liable to crack during heating and cooling than hydrogen-free 
steel. It was not suggested that hydrogen alone is the cause of crack 
formation. In fact, in slowly cooled material, free from macro-stresses, 
the hair-line cracks were attributed to the combined effects of hydrogen, 
local stresses, and low ductility in a preferred direction. The reader is 
referred to the paper for complete details of this interesting work, and 
for the puzzling problems which had to be left unanswered. 

Recently ANDREW*** has published the results of a comprehensive 
study of the relationship between hydrogen and transformation 
characteristics in steel. Crack formation and embrittlement were found 
to be closely associated with each other, and with the general theme 
of the study, but the amount of hydrogen required for embrittlement 
is less than that needed for hair-line crack formation. In each case the 
crack formation is due to “disruptive hydrogen pressure being built up 
in internal cavities or voids.’’ This pressure would be caused by the 
diffusion of atomic hydrogen into any imperfection in the lattice which 
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would hold more than one hydrogen molecule. It was also suggested 
that the reaction between hydrogen and iron carbide might create a 
pressure of methane sufficient to rupture the lattice. Finally, ANDREW 
emphasized that hydrogen diffusivity and solubility are the two main 
factors determining the behaviour of a given steel in response to hair- 
line cracks. Unless the effect on these factors of structure and stresses 
in the steel is known, full understanding of the phenomenon cannot be 
reached. The results of some work on diffusion have now been pub- 
lished by CHanG and Bennetrt,'**® who investigated the effects of 
chromium, nickel and molybdenum on the rate of diffusion of hydrogen 
iniron. It is to be hoped that this is the first of a series of experimental 
studies carefully planned to reveal the fundamental reason for the 
phenomena. 

In another paper, ANDREW and Leer” discuss the hydrogen pheno- 
mena in steel on the basis of internal pressures caused by the hydrogen. 
It is suggested that at high temperatures the hydrogen is retained in 
the lattice in atomic form in solid solution, however, at lower tempera- 
tures the hydrogen may be occluded in molecular form at the crystalline 
boundaries. If the hydrogen is precipitated from solution at high 
temperatures, plastic yielding will relieve the stresses. At lower 
temperatures the precipitating hydrogen creates internal stresses in the 
metal. Once the pockets of molecular hydrogen have been formed, 
further diffusion is impossible. It is claimed that the concept of 
molecular hydrogen concentration at the lattice imperfections (crystal- 
lite boundaries) must be feasible since it could explain why the composi- 
tion and thermal history of the steel control the amount of hydrogen 
present in it. It was shown how the amount of hydrogen retained by a 
steel specimen increased with the increasing hardness of the specimen 
produced by rolling. This cold work would increase the volume of 
crystallite boundaries, and, therefore, the available volume for mole- 
cular hydrogen precipitation. Just as a plea was made for more funda- 
mental research on diffusion, so a need was expressed for a clearer 
understanding of the effects of composition and treatment on the 
crystalline substructures. 

Hopson and Syxes™= have confirmed a correlation between the 
ductility and the hydrogen content of low-alloy steels. Hydrogen was 
found to have little effect on the shape of the stress-strain curve, but 
fracture was caused before elongation and reduction in area had 
reached normal values. As in the other papers mentioned, it is stressed 
that the hydrogen content alone does not determine whether or not 
hair-line cracks will form, but it is a major factor. 


The author wishes to thank Dr. J. N. Pratt, who read the first draft 
of this chapter and made many helpful suggestions. He also wishes to 
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4 
THE THEORY OF SINTERING 


G. A. Geach 


Powpers of most metals will sinter when heated to about three- 
quarters of their absolute melting points while protected from oxidation 
or other gaseous attack. Permanent adherence occurs between the 
particles, the whole mass becomes less porous and its other properties 
change towards those of the dense metal. A great many studies of these 
changes have been published, but in almost every case the cold metal 
powder was mechanically compressed before sintering. This, of course, 
is the usual procedure when sintering is applied in industrial powder 
metallurgy. Each such study has taken into consideration only some of 
the factors which can affect the progress of sintering. Published work is 
reviewed by Krerrer and Hotop* and, comprehensively, by GOETZEL.™ 
Pressing increases the initial areas of contact between particles and 
reduces the porosity before sintering; it is usual to reach a far lower 


porosity when sintering a compressed powder than if a loose powder is 


used. Only very few studies of the sintering of unpressed metal powders 
have been published (ScHLEcHT, ScuvuBaRDT and DvurrscHmip®; 
Deviste”™: SHaLeR and Wvetrr’*) but from these and from other 
related work there is no suggestion that the basic course of sintering is 
modified by compression. 

Most metals are known to sinter, although those of quite low melting 
points, and a few metals of normal melting points but complicated 
structure and, perhaps, largely covalent binding, may be exceptions. 
Thus, germanium does not sinter even at 850°C (90 per cent of its 
melting point). Httrric and Hennic" have suggested it to be a general 
rule that all metals sinter appreciably at about three-quarters of their 
absolute melting points if interference by extraneous factors is avoided. 
This tends to give too high sintering temperatures for metals of normal or 
high melting points; if it applies to metals of low melting points cad- 
mium and lead should sinter at 200°C and tin at about 100°C. No 
sintering at all has been observed in tin. SAUERWALD® could find no 
adherence between zinc particles at 360°C, although Wo.tF*® observed 
coherence when zinc was extruded through a die at 300°C. Kixucnr® 
compared the hardness of metal powder pressings after heating them to 
different temperatures and, as would be expected, found the hardnesses 
of the worked metals to fall after increasing the temperature. In the 
case of zinc, however, the hardness rose slightly again after a minimum 
at 400°C. Silver also had a minimum hardness at 700°C, which is a 
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conventional sintering temperature for this metal. It is probable that 
with both metals some bonding of the actual particles commenced at 
the temperatures of minimum hardness. 

The extent to which oxide surface films interfere with the sintering 
of the low melting-point metals is not clear; available data indicates 
that interference of this origin is at most only partial. Lead, for ex- 
ample, has oxides which are reduced by carbon monoxide at very low 
temperatures, but lead powder does not sinter perceptibly when heated 
to within 5°C of its melting point for 7 hours in atmospheres of hydrogen 
or carbon monoxide, or in vacuum. Comparison of the behaviour on 
heating of zinc powder prepared by filing sticks of zinc under carbon 
tetrachloride, and never exposed to air, with that of similar filings which 
were left in the atmosphere for an hour before heating, showed a dis- 
tinctly greater coherence in the unoxidized material after 2} hours at 
380°C (95 per cent of its melting point) but the extent of this sintering 
was not comparable with that occurring in copper and similar metals 
at 50 per cent of their melting points. 

Many inorganic substances such as silica, refractory oxides and 
silicates, and halides of high melting points also sinter. So, also, do 
powdered glasses. Sulphur, silicon, carbon and pure organic materials 
which have low melting points and contain rather large permanent 
molecules or structural units seem not to sinter. Sintering appears to be 
a characteristic of substances which, at the sintering temperatures, 
contain such smal] structural units as atoms or simple ions. 

Sintering may be considered to consist of two stages: one is the 
welding of particles which are in contact and the other the elimination 
of porosity. The welding is not remarkable and it is well known that 
absolutely clean surfaces of many materials adhere when brought 
together. The shrinkage, however, implies the rapid movement of 
matter into the pores and the principal requirement of any theory of 
sintering is that it explains this flow. 

At one time it was suggested that attractive forces between particles 
of a powder developed at some characteristic sintering temperature 
(SAUERWALD®) but there is no evidence that such forces do depend 
upon temperature. Heating is required to effect the removal of obstruc- 
tions between particles and to make it possible for the area of interface 
to increase and the mass to grow less porous. The union of pieces of 
cold metals occurs in pressure welding which takes place when clean 
surfaces of such metals as aluminium, cadmium or copper are squeezed 
together. A true weld is obtained across the interface. The only prepara- 
tion required is cleaning the surfaces of oxide and grease; pressing 
serves to bring clean metal surfaces together over the welding area. 
It is inconsequent to the formation of a weld whether recrystallization 
occurs across the interface. 
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The earlier observation by Berispy® that filings of zine will adhere in 
the cold to steel and those of silver to a silver surface may also be a 
demonstration that the attraction across an interface does not require 
an elevated temperature. 

The cohesive force between small particles of quartz and of sodium 
pyroborate was measured by Brapiey," who found it to be about 
10 dynes between quartz spheres 0-05 cm in radius. This would be 
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Fig. 1. Changes in length of copper compacts pressed at 1-4 « 10* g/cm* 
and then sintered isothermally in hydrogen“ 


about the order of a tensile strength if the contact was 10~* cm across. 
Brapiey found his results very sensitive to the atmosphere about his 
specimens and so, presumably, to the surface films upon them. The 
possibility that a direct attractive force between atoms in two particles 
might act over distances of much more than atomic dimensions has been 
examined by SHater.”* He began with the Sommerfeld model of a 
metal as positive ions in an electron cloud and, for copper, deduced 
*a force of about 2 x 10° g/cm® acting across 120 A. The effect of 
temperature would be small. Action across such long distances was 
questioned by Oroway,7?*“""™"" who pointed out that by such forces 


a 1 cm diameter copper sphere would support another hanging from it 
after they had touched. 


The properties in which changes during sintering are most simply, 
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and have been most often, followed are the porosity of the specimen and 
those such as density or length which are related to it. At any given 
sintering temperature change is at first rapid and becomes gradually 
slower. 

It is not clear whether, at constant sintering temperature, the porosity 
of a material tends always to zero as sintering is prolonged or whether it 
comes to a limiting value. SmrITHELLS” states that tungsten reaches a 
fixed density on sintering for about 15 minutes. It appears as though 
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Fig. 2. Densities of different specimens after sintering for 
fixed periods at different temperatures*®, ** 


crystalline inorganic materials, such as refractory oxides, come to con- 
stant porosities, but that glasses have no such limits and tend continu- 
ously to zero porosity. The most sensitive measurements on metals 
show them to become continuously less porous during long experiments ; 
isothermal! sintering curves for copper obtained by sensitive dilatometry 
show this for times up to 60 hours (Fig. 1). 

Whether porosity actually tends to a limit or in fact to zero, it does 
come to a roughly steady value in a few hours at practical sintering 
temperatures. This value depends very markedly upon that tempera- 
ture. Much of the published experimental work has reported such 
apparently limiting densities for different temperatures. Some typical 
examples of density, as a fraction of that of the compact metal, are 
plotted against sintering temperature, as a fraction of the absolute 
melting points, in Fig. 2. 

Results for iron are interesting as these show a very marked increase 
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in porosity at the a-y transformation temperature. SCHLECHT, SCHU- 
BARDT and DurrscHMID® sintered unpressed carbonyl iron powder and 
made metallographic sections after different sintering treatments, as 
well as measuring the practical limiting densities reached. Increase 
in pore size after passing the «-y transformation temperature is clear in 
their micrographs; there was little rounding of the pores below 1200°C. 

Duwez™ has also examined iron and found that the rate of densifica- 
tion at 982°C was only one-thousandth part of the rate at 816°C. 
Shrinkage at 982°C could not, in fact, be detected after two months. 

SHaLer™ made density measurements on specimens of sintered 
unpressed copper. In 6 hours at 930°C in vacuum the density rose from 
50 per cent to 95 per cent that of copper, and the remaining pores were 
sealed off from the surface. However, in similar experiments with 
atomized copper powder Geacu and Jongs observed quite different 
rates of sintering, densities after 60 hours sintering at 900°C being 75 per 
cent that of copper and at 1000°C, 80 per cent. In their specimens no 
pores were closed off from the surface. Different results from such 
similar experiments may be largely due to differences in the powders 
used; in this instance SHALER used copper powder of a less uniform 
size 

An initial expansion before the shrinkage normal to sintering is often 
noted. This is usually quite small and most commonly reported during 
slow sintering at low temperature. Thus it occurs in tungsten, which 
may increase in length by about 0-2 per cent at 600°C, and Drapgav”™ 
obtained more considerable expansions of this sort (2 per cent in length) 
in pressed copper specimens heated for 2 to 4 hours at 760°C. The 
expansion is commonly said, but has not been shown, to be due to gases 
contained in the metal. That it does not depend, as BALSHIN’ suggested, 
on cold work and subsequent recrystallization was shown by SHALER 
and Wu rr,” who sintered unpressed copper to a density of 7-1 g/cm* 
by heating for 16 hours at 850°C in argon, and on then heating the same 
specimen in vacuum for a further 8 hours found the density to fall to 
5-3 g/cm*. In this experiment any possible cold work in the material 
was annealed out during the first heating. 

Crakk and Ware found that pressings of chromic oxide also 
expanded during the first part of sintering at 1550°C; the density fell 
from 2-16 g/cm’ to 2-10 g/cm’ in 16 hours, after which it rose again. 
However, throughout this expansion, the strength of the pressing 
increased, so that bonding together of particles appeared to go on 
during the expansion. 

Measurements of mechanical properties of sintered powders have 
practical value, but generally little more; these properties are very 
sensitive to many factors in addition to sintering. Httric® has 
examined such properties of sintered metals as the rate of solution in an 
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acid or the amount of methy] alcohol vapour adsorbed ; these properties 
are all connected with the volume of pores open to the surface of the 
specimen. KIEFFER and Horor*’ have collected and compared many 
properties of sintered and melted metals. 

It is noteworthy that the temperatures at which the changes in 
different properties of a material are most marked may vary quite 
widely. This is very clear on comparing electrical resistivity, which is 
sensitive to the early growth of interparticle welds, with density. Thus, 
HAavUsNER and Prxto® examined the sintering of beryllium in vacuum 
and found that, while the resistivity fell abruptly at about 700°C 
(170 to 50 microhm-cm), the sharp increase in density took place 
above 1000°C. This difference in temperatures is quite understandable 
because it will be the early stages of sintering, when welds between 
particles develop, that will markedly influence resistivity, but the 
structural changes then occurring in the general mass will not be great. 
It is at higher temperatures that pore size is altered rapidly and that 
great changes occur in density. Myrrs® studied resistivity changes in 
tantalum pressed at 7 x 10* g/cm?* (50 tsi) and sintered for 30 minutes 
in vacuum. He obtained a marked drop in resistivity at 600°C and 
smaller ones at 1200° and 2200°C. Density did not change appreciably 
until this last temperature was reached. Myrrs suggested that the 
earlier stages are due to the removal of gases and impurities, but 
HavsNeER and Deprick™ supposed the second step due to evaporation 
of tantalum within pores and its condensation at necks. Resistivity is 
very dependent upon particle size; thus for copper pressed at 
1-4 x 10° g/cm* (10 tsi) and sintered 14 hours in hydrogen, Havsner®™ 
gives the following resistivities : 

tesistivities in Microhm-cm 
Sintering Temperatures 


Particle Size 
600°C 800°C 1000 °¢ 


44 10~-* cm 5-37 


2 x 10-* cm 5-01 2-45 1-74 
1-97 
44-74 x 10-* cm 5-6 2-52 


Resistivity values often give a smooth curve when plotted against 
sintered densities. 

Several workers have made metallographic examination of the welds 
between particles and in general found that no changes are to be 
detected during early stages of sintering when the compact as a whole 
gains strength and density. This is in agreement with the fact that 
crystal growth across the interface is not essential in pressure welding. 
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LIkKHTMAN,” however, reports the observation of minute bridges formed 
between particles of electrolytic copper at temperatures as low as 
300°C. 

Brer and O’Keere® found that in specimens of copper reduced from 
the oxide and pressed at 30 tsi the original particles only ceased to be 
identifiable on polished sections after sintering at quite high tempera- 
tures, for instance after 30 minutes at 900°C. Cook and Puc” 
examined the changes in the structure of three types of copper powder 
on sintering. They found that the larger original particles retained their 
identities after sintering at temperatures between 400° and 800°C, but 
at 800° to 1000°C the whole specimen acquired a fine-grained equiaxed 
and porous structure. Recrystallization generally occurred at much 
lower temperatures in the largest particles. 

In electrolytic iron, studied by Lirsscu, VoLTeRR«a and Wo trr,* 
after pressing at 4-6 x 10° g/cm®* (33 tsi) and sintering for 4 hours in 
hydrogen, little change could be seen at 500°C (density 6-4 g/cm*) but at 
625°C (density 6-8 g/cm*) recrystallization in grains and boundaries ap- 
peared. At 725°C (density 7-0 g/cm*) a new structure of uniform equi- 
axed grains less in size than the original particles had developed, and 
from 825°C (density 7-2 g/cm*) to 900°C (density 6-9 g/cm*) very marked 
grain growth occurred. Increased porosity near the a-transformation 
temperature was thought not to be due to the transformation itself, but 
perhaps to liberation of gases in the metal; the increase appeared to be 
spaced over a 100°C range about the transformation point. Powders 
annealed before sintering changed less than similar material that had 
been strained but not annealed; thus an electrolytic iron powder with 
particles in a normal strained state increased in density from 3-0 to 
3-7 g/cm* on heating for 24 hours at 700°C but if the powder used was 
first annealed no change in density took place under these conditions. 

Pores in the iron specimens appeared larger and were more rounded 
after sintering at 900°C than at lower temperatures. The fact that pores 
are usually more apparent as sintering proceeds probably arises because 
flattened porous areas around grains change to more noticeable round 
shapes. In a pressed powder the pores will tend to be flattened and 
to extend thinly normal to the pressing direction. As such pores become 
rounded shrinkage should be more marked normal to than along the 
pressing direction; this has been observed, for instance, by Kamm, 
STEINBERG and Wou.rr® for copper. STersiTz”* associated changes in 
the magnetic permeability of pressed iron powders when sintered with 
the same change in pore shapes. Specimens pressed at 1-4 x 10* to 
14 x 10* g/cm? (10 to 50 tsi) had permeabilities after sintering for 
1 hour at 1150°C which agreed with theory for a mass having the 
porosity distributed as flat discs. Specimens sintered at 1250° to 1350°C 
for 24 hours had higher permeabilities than those of the same densities 
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obtained at lower sintering temperatures, but on re-pressing the permea- 
bility fell back to the ‘‘flat-pore”’ value. 

Small pores disappear before larger ones; for example RHINEs, 
BIRCHENALL and HucHEs® found that the average pore size in copper 
powders increases on sintering. The number of pores of cross sections 
of 9 x 10-* and 12 x 10-* cm? in 0-0293 cm? of surface after sintering 
in hydrogen are given in the table to illustrate these points, and Fig. 3 
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Fig. 8. The population distribution of pores in copper compacts 
after sintering in hydrogen at 1000°C 


shows the general effect. Results are rather more erratic but the general 
trends are the same when sintering in argon or in vacuum. 


Number of Pores in 0-0293 cm? of Section 


Sintering Time (Hours) 
Sintering Temperature °C : _ —= 


( 800 1496 
9 x 10-* cm* pores 900 1654 
1000 1415 


800 
12 x 10-* cm* pores 900 
1000 


These workers also reported that regions near the outer surfaces of a 
specimen appeared to become free from pores before the centre did. 
GREENOUGH” has made a very interesting observation on the 
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behaviour of small ‘“‘blow holes’’ (about 0-001 cm across) in silver. The 
specimen was cold rolled sheet in which the holes were randomly 
distributed. After it had been heated for 19 hours in nitrogen at 900°C 
almost all the blow holes were collected on the new grain boundaries of 
the metal. GREENOUGH also noted that the average size of these holes 
had increased. He suggested that the holes move with boundaries, 
part of the area of which they eliminate, but it is not at all clear how in 
this phenomenon material crosses a moving hole. In the same specimens 
large grooves along boundaries at the surface of the silver and a depres- 
sion of some grains below the level of others were noted. This implies 
that holes move along boundaries, intact or as vacant sites ; apparently 
one grain dominates the supply of atoms to move a hole. GREENOUGH 
questioned the validity of any assumption of random distribution of 
pores on sintering; he believed that, if sintering involved a plastic flow 
of matter, a body with pores distributed on planes in this way should 
increase in density more rapidly than one in which they were randomly 
distributed. 

Experiments in which copper wires from 0-005 to 0-050 cm diameter 
were wound in close-packed arrangement on bobbins, heated in hydrogen 
and then sectioned on a diameter for metallographic examination have 
been made by Geacu and Jongs*®’ and by ALEXANDER and BaLvrrFi’; 
these gave pictures resembling an ideal section through contacts 
between close-packed spherical particles of powder. The former workers 
found the original boundaries between wires to be surprisingly per- 
manent and to remain generally visible long after the grains within the 
wires were of the same order of cross section as the wires. The rounding 
of the “triangular” spaces between wires depended very markedly on 
wire diameters. Thus at 1000°C the pores between 0-005 cm diameter 
wires became circular in section in 6 hours, but between 0-013 cm 
diameter and larger wires they remained clearly triangular. The fifth 
power of the width of the junctions between wires was roughly propor- 
tional to the time of heating and also to the square of the diameter of 
the wire. Contraction of pore area was slower than that occurring in 
powders. ALEXANDER and Ba.vurri found a cavity to shrink so long as 
a grain boundary ran into it; when this was no longer the case the 
cavity would still grow more rounded in shape, but no further change 
in area occurred. As boundaries disappear more quickly at higher 
temperatures the pores did not shrink so much, although what change 
there was took place more quickly. The same authors recorded that the 
rate of sintering of silver powder is sensitive to the grain size and the 
consequent concentration of boundaries. 

The behaviour of synthetic pores consisting of holes 0-016, 0-022 or 
0-026 cm radius and | cm long drilled in small bars of copper, when these 
were heated in vacuo, have been examined by POSTLETHWAITE and 
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SHALER®; the specimens stood vertically while heating. The radii 
fell to minimum values after times of 30 to 70 hours heating and rose 
again; in some instances there was an initial increase before dropping to 
the lowest value. Typical values are: 





Time (Hours) 


(cm) 


1000°C 0-0260 -262 0-263 0-260 
1050°C ‘ 0-0262 26 0-259 0-266 





Kuczynski®™: © and Kuczynski and ZavARINE® studied sections 
through the joints between round particles of copper, silver or glass and 
flat surfaces of the same materials on which they had been heated 
(Fig. 4). A boundary line at the interface generally remained visible 
and, in some cases, it appeared to have moved slightly into a copper 
particle which then stood raised on a slight pedestal above the flat 
surface. However, as the particles were not strictly round in section it 
is difficult to be certain of this. The sharp angles at points of contact 
became rounded as the width of these interfaces grew; KuczyNsKI 
measured these widths. These experiments must have been trouble- 
some, as rounded particles had to be found and the sections had to be 
taken through the centres of the junctions. The glass particles examined 
were more nearly round than the metals and showed both the rounding 
of angles and the build up of matter above the original flat surface. 

In one pf the sections published by Kuczynsk1 the flat surface of the 
copper block appears to have a shallow “‘moat’’ around the interface ; 
this specimen had been heated for 4 hours at 700°C in hydrogen, the 
copper particle having a radius of about 0-003cm; however, the 
available data cannot justify an assumption that this structure is usual] 
if, perhaps, commonly less apparent. Kuczynsxk1®? obtained most 
interesting sections through the junctions between a copper particle and 
a nickel plane or a nickel particle and copper plane (Fig. 5). In each 
case a very deep groove appears in the copper around the junction. The 
formation of these grooves would be easy to understand if rapid surface 
diffusion of the copper atoms might be assumed ; the convergence of the 
groove as it reaches deeper into the copper might also follow, for the 
mouth of the opening must move out from the original point of contact 
as the area of the weld increases. It is not apparent why any other 
mechanism should give vacancies condensing to single large cavities 
open to the surface of the copper but penetrating deeply into it. 

A single layer of 0-03 cm radius copper particles was sintered by 
DEpRICK and GERDs"’ using temperatures from 700° to 1025°C. The 
widths of the interfaces between particles were measured and varied 
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from 0-0036 to 0-0122 cm in experiments with sintering times up to 
100 hours, and it was found that the fifth power of the ratio of this 
diameter to that of the particles gave a straight line which passed 
through the origin if plotted against sintering time. SHALER”™ has also 
examined the growth of the junctions between graded copper spheres 
0-15 to 0-20cm in diameter. These were first annealed and then 
arranged in contact in a single row and heated in hydrogen. SHALER 
plotted the angle subtended at the centres of particles by the radii of 
the junctions against sintering times; this gave curves with linear 
portions for long sintering times but these did not extrapolate through 
the origin. 

The rate of sintering of a material is generally very sensitive to the 
nature of the atmosphere. Metal powders commonly reach any given 
density at a lower temperature or, at a given temperature, in less time 
in hydrogen or other reducing atmosphere than in argon or vacuum. 
Thus at 980°C copper reached the same density in about 2 hours in 
hydrogen as after 16 hours in a “vacuum” of 0-5 mm Hg. In the case 
of beryllium, Havsner and Pryto* found that after pressing beryllium 
at 2-1 x 10* g/cm* and sintering for 3 hours at 1200°C it had a density 
of 1-14 g/cm*® when the sintering was in argon, but of 1-66 g/cm* when 
it was done in air under a pressure of 0-0005 mm Hg; a less marked 
difference was found when using specimens which had been subjected to 
higher pressures. A similar sensitivity of refractory oxides, such as 
beryllium oxide, to the atmosphere in which they are sintered may, it 
has been suggested, be due to easier diffusion following the production 
of vacant oxygen sites by surface reaction with the atmosphere. Zinc 
oxide and titanium oxide depart from stoichiometry in air at sintering 
temperatures, but alumina does not; the first two sinter at two-thirds 
of the melting point absolute, the latter at about 0-85 of the melting 
point. 

LIKHTMAN™ attributed the formation of small interparticle bridges 
on heating copper at very low temperatures to mobile copper atoms 
formed as the surface oxide is reduced, or to evaporation and condensa- 
tion of the metal. He observed that in sealed-off evacuated apparatus 
in which copper was heated to 300° or 400°C, visible films of copper 
formed on liquid nitrogen traps. 

The rate of sintering is affected by powder particle size. Fine powders 
usually sinter at rather lower temperatures than coarse ones, although, 
as the finer powder may give the less dense pressing, it may still be the 
more porous metal after a given sintering treatment. Uniformly sized 
powder does not sinter as rapidly as a graded mixture; results for 
electrolytic iron powders pressed at 4-6 « 10* g/cm* (33 tsi) and sintered 
for 4 hours at 825°C in hydrogen illustrating these points are given by 
Lisscu, VOLTERRA and WuLrFr®: 
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Fig. 4. Copper sphere sintered to a plane 

copper surface by heating in hydrogen for 

one hour at 900°C. Magnification 770 
(KUCZYNSK 


Fig. 5. Sections through: Left—a copper particle sintered 


to a nickel plane by 24 hours heating at 950°C in hydrogen ; 


magnification 770. Right—a nickel particle sintered 
to copper by 45 hours heating at 1020°C in hvdrogen: 
magnification 220. Black areas are pores, not rounded 


in nickel (Kuczynsxk1™ 





THE TYEORY OF SINTERING 
Tensile Strength Density 


Powder Size { Mesi) o 
10° g/cm*) (g/cm? 


(A) 100-200 2-01 
(B) 200-325 . 1-86 
(C) 325-500 2-11 
4 parts (A) + 1 part each of (B) and (C 2-18 


Gold of such fine particle size as 10~* cm sinters at 150°C as shown by 
changes in its resistivity and in its infra-red reflectance which, it is 
shown, are related (HARRIS, JEFFRIES and SIEGEL*). 

These very low sintering temperatures were found by using particles 
of gold freshly made by condensation of the vapour; if the material 
were first “‘stabilized’’ by temperatures below 80°C it sintered less 
readily. It was suggested that the particles are in an unstable state when 
freshly prepared and that stability follows a diffusion of vacant sites to 
the surface; an activation energy for this stabilization process of 
23,500 + 1300 cal/g atom between 50° and 70°C was found. 

KALISCHER“ gave expansion-contraction curves for annealed and 
unannealed nickel powder compacts when heating in hydrogen at 
13-3°C per minute. The unannealed powder began to contract at a lower 
temperature and shrank more than the annealed powder. 

WARREN and Lisscu” examined the sintering of 50/50 iron/cobalt 
mixtures. A specimen heated rapidly to 1400°C and sintered had more 
pores and larger pores than a similar specimen first heated between 800° 
and 1400°C before sintering. DELIsLE™ obtained some indication that 
loose iron powder sintered less readily if annealed than if strained. 

Similarly, pressing a powder causes sintering to occur more readily and 
leads to a lower porosity after any given sintering treatment. Typical 
results are those for molybdenum obtained by GruBE and ScHLEecutT.** 
After sintering for 3 hours in hydrogen they obtained the following 
densities : 


Pressure (kg/cm? 1000 2000 7000 
Density (g/cm*) after 1000°C 4-6 5-6 7-6 


Density (g/cm*) after 1700°C . ye 9-4 9-5 10-0 


Slightly greater porosity found in some metals after pressing at very 
high pressure is probably due to expansion by enclosed gas. The effects 


of pressure on several metals are reported and surveyed by BaLxKe.® 
The application of pressure during sintering has a very marked effect. 
LAURENT and Evprer™ measured the widths of the junctions between 
0-06 cm diameter copper spheres and plane surfaces after heating in 
Kuczynski-type experiments. On heating for 100 minutes at 1050°C in 
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cracked ammonia a junction with a diameter of 0-014 cm was formed ; 
if a load of 10 gm pressed the sphere on to the plane the same heating 
gave a diameter of 0-036 cm. Loads reaching up to this value are pro- 
duced by the pressures used in technical hot pressing practice. 

Copper was pressed cold under a pressure of 8-8 x 10° g/cm? 
(12,500 psi) to make compacts which were then sintered in hydrogen 
while under loads of up to 1-16 x 10‘ g/cm* by Rutyes and Cannon.** 
No flattening of pores could be discovered after sintering at 1000°C. 
Density increased more rapidly on loading, and in such a way that the 
logarithm of the time required to reach any chosen density was related 
linearly to the stress used. Typical results for a density equal to 95 per 
cent of the bulk density of copper are: 


Time (Hours) 38-0 27-2 14°5 4-0 
Stress (g/cm?) 0 700 2100 4900 
log'ts*) stress — 0-0002 0-0002 0-0002 






Here t is the time to reach the density with the specimen under stress 
and t, the time when it is under no load. The function in the third row 
is constant for a linear relationship. 

Any given load produces a constant proportional increase in rate for 
all times of sintering. A “densification parameter” § = (q,— q,)/4,, 
where g, and q, are the porosity of specimens given the same sintering 
treatment unloaded and loaded, respectively, is independent of time for 
each load. Loading within the range studied does not, then, introduce 
a fresh mechanism into the processes of sintering. The effect of stress 
on sintering found in this work is, as Ratmyes and CaNnNon point out, 
similar to that of stress on creep rate. 

It is now generally assumed that reduction of surface area and so of 
surface energy provides the driving energy for sintering. The energy so 
provided is small; early speculations have considered other possibilities 
but it seems that these can make only incidental contributions to the 
cause of sintering. The surface energy of copper powder with particles 
of 0-005 cm radius would be about 2 x 10-* cal/g (0-1 joules/g). If 
copper powder were compacted by hot pressing, a pressure of about 
10* g/cm? (200 lb/in.*) would be required to supply about the same 
energy to the specimen. 

The energy introduced into the powder particles by cold pressing and 
made available on heating has also been suggested as a factor in sinter- 
ing. BALsHIN’ supposed that recrystallization occurs by a re-organiza- 
tion following cold work and that individual atoms diffuse to nuclei 
which grow at the junctions between particles. The fact that the 
greater part of sintering usually occurs at times and temperatures 
beyond those needed for full annealing and that much densification 
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takes place after this anneal leaves much action by recrystallization 
improbable. 

SAUERWALD,® in an early speculation, suggested that at. some charac- 
teristic temperature which is about three-quarters of the absolute 
melting point and not affected by pressing, an interchange of atoms 
occurs between particles and leads to grain growth. Raising the tem- 
perature should then aid adhesion by increasing atom mobility and by 
smoothing away surface irregularities. 

Situ,’ in another early speculation, suggested that sintering might 
depend upon changes in structure from amorphous to crystalline or 
from one crystal form to another, these presumably being supposed to 
free atoms for individual movement. SmITH’s own determinations of the 
lowest temperatures at which sintering of unpressed powders of differ- 
ent metals could be detected by prodding with a wire hardly support 
the view; he detected sintering in iron at 250°C, in copper at 250°C, 
silver at 180°C and molybdenum at 800°C, but none in manganese 
or tin. 

Surface tension at a pore might induce local flow or surface diffusion, 
but density changes require flow away from the pores at positions 
where any direct effects due to surface tension are “diluted’”’ away. 
Considerable interest now centres on values of the surface tensions of 
solids and of the energies of grain boundary areas. Recent work has 
been reviewed by FisHER and Dunn.™ In one method of determining 
the surface energy of solids, weights are suspended from foils or wires 
which are heated, and the weight at which no change in length occurs is 
determined. This is the force required to balance the effect of surface 
and grain boundary tensions. Uprn”’ has shown that the grain boundary 
effect in such experiments is not, as earlier workers supposed, trifling, 
but that it can be dealt with in wires of ‘“bamboo-like”’ structure, that 
is, having grain boundaries normal! to their axis and occupying the full 
cross section. If W is the load giving no change in length, Uprin puts 


W = aRy — NaRyp ee 


where R is the radius of the wire, N the number of grains per unit length 
and y, the grain boundary energy. He believes that yz, = y/3 and thus 
deduces from the experimental results of Upry, SHaLER and WuLFr”® 
that copper has a surface tension of 1650 + 100 dynes/em and a tem- 
perature coefficient of — 0-55 dynes/em/°C. By the same procedure 
values of 1140 + 90 dynes/cm for silver®* and between 875° and 932°C 
and of 1400 + 65 dynes/cm for gold* over the range 1017° to 1042°C 
have been given. The atmosphere appears to have a real effect; the 
above data were obtained in helium; one run in air gave a value for 
gold of 1207 + 75 dynes/cm. 


—_ 


By another method Bartey and Warxrns' obtained values for copper. 
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They prepared grooves along the grain boundaries of copper by thermal 
etching at 800° to 900°C and measured the angles at the bottom of these 
grooves between their sides. This angle was measured on specimens 
which had been heated in argon (¢,), and on others heated in argon and 
lead vapour (¢,), and the contact angle (¢,) between copper and molten 
lead was also measured. By balancing surface tension forces one obtains 
the relation 


) 


The quantities on the right of this are known or measured in the 
experiments. Results again show that the atmosphere has an effect : 


cos (¢,/2) 
Youjcu = (Youpn 7 Y pv vapour « COS $3) = 9 


cos (d, 


800°C in hydrogen . 1750 dynes/cm 
900°C in hydrogen . 2000 dynes/cm 
900°C in argon . 1600 dynes/cm 


Results from these two methods are in general agreement. RyMER and 
BuTLer® calculated a quite different value of 500 dynes/cm for gold in 
order to explain the anomalous surface lattice spacings measured by 
electron diffraction. 

Considerations of surface energy show that a pore should close up if 
the surface energy of the material is more than half of that of the 
grain boundary which would be formed; for copper, it appears to be 
two or three times as great. 

Sintering may be considered as a simple rate process. If the logarithms 
of the times to reach a certain stage in the process, that is for some 
property to reach some special value, are plotted against the reciprocal 
of the temperature a straight line is obtained if the Arrhenius equation is 
applicable. For such a rate process a definite activation energy exists. 
Htrric* measured the electrical resistivity of electrolytic copper 
powder after various times at different temperatures in hydrogen and 
obtained fair linear plots of this type, taking the times required to reach 
selected resistivities. He obtained an activation energy of 56,000 cal/g 
atom. Duwez™ treated porosity data for copper in the same way and 
obtained an activation energy of 80,000 cal/g atom for sintering in 
hydrogen, but for the same copper sintered in vacuum he did not obtain 
the required straight line plots of log time against 1/7. This would 
seem to mean that more than one activation process is involved in the 
sintering. The activation energy is shown by Duwez’s curves to fall as 
the sintering temperature is increased. 

FRENKEL” suggested that sintering solids flow as Newtonian viscous 
liquids, that is, so that the strain rate is linearly related to stress, surface 
tension being the driving force. The viscous flow is supposed to arise 
from the diffusion of individual vacant atom sites and is thus distinct 
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from plastic flow which is due to the orderly slipping of atomic layers in 
a crystal. He compares the sintering of two spheres with the merging 
of two drops of liquid. Equating the decrease of surface energy 
released to the energy dissipated as a result of viscous flow, he deduces 
that the area of contact, mr?, should increase at the steady rate 


3R , , 
- —" +. Frenxe also found that gas pressure in a pore will stop its 
29 

shrinkage when this equals the pressure associated with surface tension 


x3 
2 , 


pressure; this will be at a minimum for a pore of radius | 


X being the original pore radius and P the gas pressure. He showed 
how an activation energy may be derived as already described, and 
believes that it is the activation energy of self diffusion. SHaLER and 
Wo.rr” used FRENKEL’s model and calculated an activation energy 
of 85,000 cal/g for copper; this value is rather higher than those 
obtained directly for diffusion (60,000 cal/g atom). For the viscosity 
coefficient of a metal 7 FRENKEL gave the expression 7 = RT7'/Da, 
where 7’ is the temperature, a the lattice constant and D the self- 
diffusion coefficient. Taking published values for D this gave for copper 
at 1000°C » = 3-1 x 10° poises, a viscosity roughly comparable with 
Golden Syrup at room temperature ! 

NABARRO® has criticized FRENKEL’s treatment in that the move- 
ments of lattice vacancies do not provide a mechanism for viscous flow. 
Thus if a vacancy in a row of atoms moves from left to right the atoms 
in that row all move a distance of one vacancy from right to left relative 
to the rows of atoms above and below, but these two rows remain un- 
moved relative to each other, which means that there has been no 
movement of one large part of the crystal relative to the other. FREN- 
KEL’s treatment appears to require such a movement. NaBaRRO showed 
that, to be viscous, a crystal must contain mosaic structure or other 
imperfection in addition to vacant sites and its coefficient of viscosity 
will be about 10* times that given by FRENKEL’s expression ; this brings 
the value for copper at 1000°C to the order of that of pitch at 0°C. 

ALEXANDER, KvuczynskI and Dawson‘ considered, from FRENKEL’s 
point of view, data for the creep of gold wires (0-001 in. diameter) at 
970°C. They plotted strain against time for various loads and taking 
the strain rate from the most linear part of the curves plotted this 
against stress, obtaining a straight line. This meant that under these 
limited circumstances flow was viscous, and its coefficient of viscosity 
came out as 2-36 x 10" poises; it also gives a surface tension for gold of 
1280 dynes/cm*. The linear parts of the strain against time curves 
were given by the data for longer times; the fact that the early parts 
are steeper and depart from linearity implies that during sintering flow is 
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not at first purely viscous and that the strain rate is higher than for the 





apparently viscous conditions. 

Using the same data and plotting initial strain rate against stress 
gave a curve consisting of two straight lines intersecting at what should 
be a critical shear stress: it has a value of 7 kg/mm? (the measured 
value at room temperature is 9-2 kg/mm?). The part of the curve below 
this stress corresponds to viscous flow and a viscosity of 1-4 x 10" 
poises. Above the critical stress initial strain rate is again linear with 










Fig. 6. CLARK and Wuirte’s model of the sintering of two spherical 
particles; the shaded section is through the ‘‘lens’’ forming the junction** 





stress but it increases more quickly and the coefficient of viscosity is 
1-5 x 10" poises. It is suggested that at low stresses viscous flow 






occurs by vacancy migration but for higher stresses a formula obtained 
by NaBarRo® and connecting strain rate with the size of a mosaic 
block, or other region over which the lattice is coherent, gives a strain 
rate in agreement with the experiment, assuming mosaic blocks to be 
10~* cm across. 

SHALER’s™ measurements of the widths of junctions between par- 
ticles should, when plotted against the sintering time, give a straight 
line through the origin if FRENKEL’s theory were correct. In fact the 
curves bend steeply towards the time axis during the first hours of 
sintering and, although they may become linear over long times (over 
20 hours), the linear parts do not extrapolate to the origin. SHALER 
thought the form of these curves to be evidence for two mechanisms ; 
he suggested an initial rapid flow by creep or slip, with only the slower 
viscous flow operating at longer times. 

CLaRK and Wuire™ have examined the possibility of viscous flow 
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of a layer of material of thickness h to form a lens between two spherical 
particles. Their model is shown in Fig. 6. It is assumed that these 
particles remain spherical] and that flow or diffusion maintains this form 
while matter transfers to the junctions and the volumes of the particles 
decreases. Clearly this can be true only of the early part of sintering and 
even then this is not the picture that metallography has given of the 
changes in metal compacts. CLARK and WHITE made experiments with 
refractory oxides which perhaps come nearer to their model. 

Surface tension (y) will impose a positive hydrostatic pressure on 
material in the spherical particles and, because of their concave surface, 
a negative hydrostatic pressure within a lens. The pressure difference 
acting from sphere to lens will be 


{a Ls 
R ]l—z \ ‘] aime x2 
It is supposed that flow under this pressure is in shear and occurs 
viscously by activated jumps of moving units. It is assumed that this 
flow occurs primarily in a thin surface film of the almost cylindrical 
lens and that most of the pressure drop at the slow rate of flow occurring 
will be over a short length / on either side of the plane AA (Fig. 6) to 
which all the moving material must converge. 
Flow into one lens of volume v can then be written as 


dv “ aPr* [4 (h\? 
—=2 2er.u,.dr= — (—) . 
dt To nl 3 r; 


. 


Here u, is the velocity of flow where radius r, has the value r and 1, is 
the radius of the stationary part of the lens, that is 7, = r, — . From 
the equation CLaRK and WHITE obtain the change of volume shrinkage 
V with time as 


. nV 1— x x 
fj 2+ = — a i (4) 
9 l—z V1l—z* 


where V is the total volume of the lenses in 1 gram of specimen, q, the 
initial pore volume in | gram, n the number of contacts with neighbours 
made by each particle and B = yh®/pnlR®. Without varying tempera- 
tures the most significant term is 7 which, for an activated flow process, 
varies so that B a ¢~*/*", where E is an energy of activation for the 
flow process. 

CLARK and Wuire tested their ideas about a sintering mechanism 
by obtaining experimental shrinkage-time curves and comparing them 
with calculated ones. 

So far the assumption has been latent in this model that lenses remain 
distinct from each other. In fact they will merge at some value of z 
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which will depend upon the packing arrangement of the particles; if 
each particle had eight neighbours this will be when z is about 0-8. 
Actually Cuark and Warre found that curves derived from their 
equations agreed with experimental data when taken beyond this 
expected limit to the range of their applicability. 

In this way CLarRK and Warre obtained curves giving good fit with 
experiments made on a glass at temperatures of 598°, 626°, 648° and 
676°C. This material had a density p, of 2-54 g/cm’, q, the pore volume 
per gram was (21 cm’, the surface tension y was about 400 dynes/cm 
and at 676°C the viscosity was about 10’ poises; n was assumed to be 8. 
Experimental results obtained with alumina or with beryllium oxide did 
not agree with curves calculated on this basis of simple viscous flow. 
For these materials a Bingham type of flow with a critical shear stress 
r, was considered; this gave rates of strain proportional to the stress 
due to surface tensions when reduced by subtracting the quantity r,. 
This gave a modified shrinkage curve 


dl 


p+ nv i 
dt 


yj 2 


BI; 


| 3R,/r,) l ™ 
_ : (5) 


Vp (l + Vp)? 
the term in curly brackets being a dimensionless number. 

Curves were calculated from this which agreed well with experiments 
made on a- and y-alumina and a beryllium oxide. 

The quantity B = yh*/pnlR* varies with temperature and in this 
the change in 7 is the most dominant factor. This varies with tempera- 
ture according to the Arrhenius activation energy relationship 
B «x e*/*? where £ is the activation energy for the flow process. It 
follows that log B should give a straight line when plotted against 1/7’. 
For alumina CLaRK and Warre found this true and it also held roughly 
for magnesia. For glass, however, the relationship obtained gave a 
curve which implied a rise in the activation energy with falling 
temperature. The activation energies for viscous flow of glass do 
vary in this way. Values of the activation energy obtained were 
87,000 cal/mole for alumina and, most probably, 43,000 cal/mole for 
magnesia. 

It appears probable from the metallographic work on the “sintering”’ 
together of wires, or that of spherical particles to planes, that the par- 
ticle radii all vary little throughout a large part of the changes during 
which contact radii grow. This seems generally true until, in a powder, 
the pores would be sealed off, and this commonly occurs when the 
porosity is about 15 per cent of the total volume. In deriving their 
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shrinkage time relationships CLARK and WHITE assumed the particle 
radii to decrease uniformly. Their further assumption that the 
thickness, h, of the moving layer remains constant throughout sintering 
would appear open to question in the cases of solids giving Bingham 
flow. The assumption would imply that as the lens grows the thickness 
over which stress exceeds the yield value is not dependent upon the 
radii of the lens, r, and 14. 

MACKENZIE and SHUTTLEWORTH®’ have considered viscous or plastic 
flow of a mass containing uniformly distributed sealed-off spherical 
pores. Surface tension on the pores, all of radius X, is considered equi- 
valent to an external pressure of + 2y/X applied to a solid which is 
assumed to be incompressible, so that the effect is uniform through any 
size or shape of sintering material. The effective pressure in copper 
would be that of an applied load of one atmosphere. An equation of 
state, that is an equation linking strain rate with stress at constant 
temperatures, was sought, and again viscous flow and Bingham flow 
were examined. In this work no atomic mechanism was considered, but 
by equating energy from reduction of surface area with energy dissi- 
pated in flow, equations for shrinkage with sintering time were again 
developed. 

MACKENZIE and SHUTTLEWORTH do, however, argue that flow must 
arise from a macroscopic mechanism in which atoms move in assemblies 
and not from any process in which they move as individuals, which, 
they say, would inevitably be too slow to explain the observed rates. To 
illustrate this they show that if the excess concentration of vacant 
lattice sites in copper at 1000°C due to pores of 10~* cm radius 
in a specimen giving an average distance of 10-' cm from the surface, 
then shrinkage due to volume diffusion would be at a rate of about 
10-*/hour. If sintering were by volume diffusion, however, it should 
occur from the surface inwards as diffusion of vacant sites to the 
surface would not occur from any pore while other pores screened 
the surface from it. While such an effect has been recorded™ there 
is not sufficient evidence to show that it is a general phenomenon 
of sintering; plastic mounting of a specimen for metallographic 
examination is itself able to develop what appears as this effect. 

The first step in this derivation of shrinkage-time relationships is to 
consider the shrinkage of one pore of radius X surrounded by a spherical 
shell (radius X,) of non-porous matter, the whole placed in a mass of 
uniform matter (that is, the rest of the sintering specimen). The stress 
in this mass being everywhere a continuous hydrostatic pressure and the 
overall density of this pore with its shell being that of the whole mass, 
the radia] strain rate is 
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where wu is the radial velocity at the surface of the pore. Derivation of 
this takes from the theory of viscosity the relationship d(/dt 

3. (du/dX)* . n, where G is the energy. If there were only one isolated 
pore in a specimen the outer radius of the shell would be infinite. By 
putting a limiting thickness to the shell, that is giving X, a finite value, 
the interaction of other pores with one under consideration is taken into 
account. This treatment is strict for small values of X,/X,, but it is 
not clear how accurate it is when this quantity is large, that is when 
the pores are close together. This means that MACKENZIE and SHUTTLE- 
WORTH’ S expressions should hold more rigidly during the later stage of 
sintering. 

A solid with Newtonian viscosity has strain rate proportional to 
stress and viscosity 7 independent of strain, so that the radial velocity 
is u= 2 =. For an isolated pore p = 1 and the factor 1/p re- 
presents the effect of all other pores on the rate of sintering of one 
pore. In a powder compact p is generally more than 0-5, so that a pore 
in a compact will not close up at rates greater than twice that of an 
isolated pore. 

The rate of density change of a Newtonian solid with time during an 
isothermal sintering run is given in terms of the number of pores, m, 
in a unit volume, the surface tension and viscosity by: 


1/3 [* dp (6) 


Jo (l= p)*3p! 3 

For a Bingham solid of critical shear stress +, the viscosity at any 
moment is dependent on the strain, and the shrinkage of a pore, 
— dX, /dt (radial velocity of pore surface inwards) was found to be 


iy f _8y2rXs | xy 
2" vp 2y XxX, 


where 7, is the ultimate value to which 7 tends. 
With such materials a pore will not close unless 


this means that there is a critical size of pore 
l—p 
such that material containing only larger pores will not increase in 
density. 
The rate of increase in density is given by the equation : 
dp 3 (47\"* ym'* 
a2 | a 


: ‘ | , l 
.(1— p)**. p'*] 1-— l/p— 1)'*] - 
( p) p x \\ p ) a{—) 


") 


where « = 





THE THEORY OF SINTERING 


From this equation values of density can be calculated and plotted 
for any selected value of « against a reduced time ym'/*(t — t,)/n_. It 
is clear from the equation defining « that this quantity is zero for the 
case of Newtonian viscous flow, that is when 7+, is zero, and that it 
increases with r,; this quantity « also depends upon the number of 
pores per unit volume, so that dp/dt depends upon this quantity. From 
equation (7) dp/dt can be seen to be positive, that is, sintering with 
shrinkage {, occurs, only when the expression in the square bracket is 
positive. This is generally true if « is small, but when « > 0-89 the 


value of dp/dt falls to zero as the square bracket term itself falls to 
;, ] : , , 
zero, or when (1/p — 1)** In| - = 1/a«. If pressing before sintering 
—~», 
had given a density greater than 0-94 of that of the massive material 
sintering could go on to the theoretical density. 


;, l ; 
The term (1/p — 1)” In| ¥ tends rapidly to zero as p tends to 
—p 


unity, which means that when pores are very small or very far apart 
they will cease to close. 

MACKENZIE and SHUTTLEWORTH calculated from strain experiments 
on copper wires made by Uprn, SHALER and WuLFr’ that the critical 
shear stress of copper is greater than 2-0 x 105 dynes/cm? at 1000°C, and 
from sintering experiments” that it is less than 2-6 x 10° dynes/cm* 
at 850°C. At room temperature the value is 10’ dynes/cm*. 

Kuczynsk!®™ examined theoretically what the rate of growth of the 
contact between a sphere and a plane surface on heating should be in 
the cases of mechanisms based on viscous or plastic flow, on volume 
diffusion, on surface diffusion and on evaporation and condensation. 
He found by his experiments what power of the contact radius was 
proportional to “sintering’’ time and compared these results with 
predictions obtained from his considerations of mechanisms. 

For an evaporation and condensation mechanism he takes the pres- 
sures of vapour p, and p, over surfaces of radii R and r, as given by the 
Kelvin equation, 

Pe — 2ya* ] 
"Dp, &T | 
a being the interatomic distance in the solid. The rate of movement of 
matter at one temperature should be proportional to the pressure 
difference (p, — p,), and as R is effectively constant, p, does not alter, 
and so the rate of movement of matter is inversely proportional to 7, 
that is the rate of movement M = k'/r,, where k’ depends on tempera- 
ture and R. Expressing r, in terms of r,, the quantity M was shown to 
be proportional to 1/r,2 where M is now the rate of condensation on the 
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R dv. 
junction of area zr,*/R. Putting M « —.. 7 KuczynskI found that 






the cube of the junction radius is proportional to the time, r,° « ¢. 
Volume diffusion was considered in terms of the movement of vacant 
lattice sites. Near the junction the radius of curvature is small; an 
increase in volume here would reduce surface energy and could 
occur if the concentration of vacant sites increased. This concentration 
depends upon the curvature of nearby surfaces so that there will 
be a concentration gradient of vacant sites between the junction 
“lenses’”’ and the rest of the sintering particles and vacant sites 
will diffuse away from the junctions or atoms will come into 


them. Kelvin’s relationship may be written as AC = pe wal 

where C is the concentration of vacant sites and Z’ is the energy 

required to produce one vacancy. If D, is the coefficient of volume 

diffusion, that of diffusion of holes is D’ = D,e*'** and Fick’s equation 
ar? AC i - 

is —— . — = -——; combining these gave 


R rs : dt 









(8) 







The case for surface diffusion has been examined by CaBRERa™ and 
ScHweEp” as well as by Kuczynski. They have reached different 
relationships between junction radius and time. The equilibrium 
concentration of adsorbed atoms or of vacant sites at any point will 
depend upon the curvature of the surface there. There will thus be 
different concentrations at the junction and at surfaces of larger radii 
and this will be such that atoms flow to the junction. The equilibrium 
concentrations of adsorbed atoms or vacant sites on any surface may be 

aN kept up by condensation from 
vapour or from growth dislocations 
in the crystal surfaces. 

CABRERA considered the surface 
as of two parts with uniform con- 
centrations of adsorbed atoms over 
each and a discontinuity at the 
junction. According to SCHWED, 
Kuczynsk1 considered only a 
vacancy concentration gradient in 
the neck. ScHWED argued that 
these models are not adequate. Setting up an equation for diffusion past 
the junction A in Fig. 7, and avoiding these restrictions but assuming 
the flow to be balanced at any point by interchange of atoms between 
normal surface positions and sites which steps in the surface enfold 
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Fig. 7. Section through a spherical 
particle sintered to a plane surface as 
in KUCZYNSKI’s experiments*' 
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where they bend, he finds that for small values of the radius r, the 
growth law is r,* oc t; where r, is larger, the relation is r,5 o t. Here r, 
is to be considered small or large if it is much smaller or greater than 
a distance (D,t,)'*/7, where t, is the relaxation time for condensation 
of atoms on the surface. It is not easy to estimate this distance with 
any certainty. Putting D, = 10-* cm?/sec and r, = 10-*cm would 
imply a relaxation time of 10-* seconds as standing between the two 
sintering laws. 

KvczyYNnskKI originally proposed a law r,’ « ¢ for surface diffusion 
and CaBRERA, amending the argument, gave r,° oc ¢, which would not 
allow distinction between surface and volume diffusion. 

These predicted relationships, all of the form (r,)* ot, may be 
summarized as a table. 





Sintering Mechanism 
Viscous or plastic flow 
Evaporation—condensation 
Volume diffusion : 
Surface diffusion (SCHWED) . | 8 ifr, is small 
5 if r, is large 





According to ScHWED, atom or site replacement by equilibrium 
reaction alone gives a flow which leads to no net junction growth as 
atoms arrive or leave at equal rates by this means. 

KvczyNsKI examined experimentally the behaviour of copper, 
silver® and glass.*° He sintered round particles on to flat surfaces and 
then sectioned these and measured the radii of the junctions. This 
must be a very difficult experiment. One of these specimens is shown in 
Fig. 4. For particles of copper (4 x 10-*cm radius) or of silver 
(2 x 10-* cm radius) he obtained values of log (r/R), which gave linear 
plots against time, ¢; the slopes varied between 4-6 to 5-4. This corres- 
ponds with his deduced value of 5 for a volume diffusion mechanism. 

With very much smaller copper particles (5 x 10-*cm diameter) 
KUCZYNSKI’sS experiments gave a value of K of about 6-5, which lies 
closer to his prediction for a surface diffusion mechanism, but in the 
light of CaBRERA and ScHWED’s work this result is not easy to explain, 
and in fact the result for very small copper particles is inconsistent with 
the theory. 

In his experiments with a glass Kuczynsk1® used beads of c. 2-5 
10-2 cm radius and obtained excellent straight-line plots of (r,/R)* 
against t. (The data tabulated in his paper contain some misprints.) 
This is his relationship for a viscous flow mechanism of sintering. 
KvczyNskI's work is thus a very vivid indication that sintering can be 
dominated by quite different processes and that the nature of the 
material is an important factor in determining which are important. 
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MACKENZIE and SHUTTLEWORTH suggest that in KucZYNSKI’s experi- 
ments shear stress will be zero except just at the interface and con- 
sequently any plastic movement of the bulk of the material is 
impossible. The elimination of plastic or viscous flow would make it 
possible for some other, and perhaps normally subsidiary, mechanism 
to dominate the process. 

There are very few data on the mechanical properties of metals under 
small stresses at sintering temperatures. ALEXANDER, Dawson and 
Kuro’ have obtained creep curves on 0-0028 cm diameter gold wires 
under small stresses at 920° to 1020°C. At such low stresses the 
minimum strain (creep) rate was proportional to the stress; thus at 
1020°C strain rate was linearly related to the stress up to about 
10-* dynes/cm*, at which stress the strain rate was about 8 x 10~* per 
second. This result means that the flow observed under these conditions 
was viscous; if viscous flow is possible under such minute stresses it is 
surprising that Kvuczynski’s experiments suggested diffusion as the 
mechanism of sintering under small stresses. The explanation has been 
put forward‘ that in sintering experiments of this type the flow occurs in 
the small neck of sharp radius r, = 10~* cm, which is about the dimen- 
sion of a mosaic block and that, as KuczyYNsk1’s experiments showed, 
this flow is by volume diffusion. In an aggregate of mosaic blocks, such 
as the gold wires, the creep experiment suggests that volume diffusion of 
vacancies within the blocks could lead to viscous flow of the specimens. 
Thus the same mechanism could appear as a diffusion process in sin- 
tering and as a viscous flow of the wires. 

Murray and Wriiiams®* have considered the application of the 
MACKENZIE and SHUTTLEWORTH equation, (7), to hot pressing, that is 
when a pressure is applied at sintering temperature to a mass containing 
closed pores. This pressure would require a term to be added to the 
pressure due to surface tensions. They believe that for hot pressed 
metals (dp/dt)p T is proportional to L", where L is pressure and U has 
a value normally between 3 and 2, but which falls as 7’ rises. This 
power relationship could arise if a creep process were involved in 
sintering, creep rate being proportional to a power of the stress causing it. 
Pressure is not known to influence diffusion constants and so should not 
affect sintering due to diffusion. However, pressure alone will compact 
a metal and its effect would be added to that of any other process. 

Diffusion appears to be much faster along grain boundaries than 
through grains, and as there is no indication that traces of impurity 
affect bulk diffusion markedly, the difference cannot be due to congrega- 
tion at boundaries of foreign atoms. Horrmann and TvuRNBULL™ 
diffused radioactive silver into the normal metal and by then taking 
auto-radiographs showed the active material to be concentrated at 
grain boundaries; its concentration fell away with depth in the silver. 
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In a rather similar study of diffusion of active silver into copper 
ACHTER and SMOLUCHOWSKI' found the rate of penetration to increase 
as the difference of orientation between the grains on either side of 
the boundary increased. Values of lattice (D,), boundary (D,) and 
surface (D,) diffusion coefficients for the diffusion of silver and thorium 
into tungsten are collected into the table : 


D = D,{-E/RT) 


D 


t 


Silver into tungsten— 
D, (em*/g) , , 0-895 0-025 0-16 
E (cal/g atom) . 45,950 20,200 10,300 
Thorium into tungsten— 
D, (em*/g) 1-0 0-74 0-47 
E (cal/g atom) , 120,000 90,000 66,400 


The width of a band of silver through which silver can pass by volume 
diffusion at the same rate as it does by boundary diffusion along a 
boundary is: 


Temperature 400 600 
Width . 5 f 10-2 


This width must be the minimum grain diameter if bulk (lattice) 
diffusion is to dominate the diffusion transport of material. 

GREENOUGH™ studied the extension of silver wires under small 
stresses. His measured strain rate of — 7-61 x 10-* per second at 
920°C agrees with that of — 2-66 x 10-* per second calculated by 
Herrinc. This calculation assumes that mosaic boundaries cannot 
function as sinks for diffusion currents and that diffusional viscosity 
can only account for creep at very high temperatures and low stresses. 
HERRING supposes that these conditions apply in sintering and that 
grains may slide over each other tangentially as they grow normally to 
their boundaries. 

In single crystal specimens of silver GREENOUGH™ found no measur- 
able strain, which indicates that plastic flow within grains makes no 
contribution to creep of the wires. His results for crystals 0-1 cm 
diameter tested for 2824 hours at 910°C were: 

Stress 


Specimen . Strain 
dynes/cm* 


Single Crystal | 131,000 + 0-005 
Single Crystal 2 114,000 + 0-00] 
Polycrystalline 126,000 + 0-058 
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Herrive™ has examined the effect of the linear dimensions of a 
sintering system for different sintering mechanisms. Thus, if two com- 
pacts contain particles, in one case of radius R,, in the other of R, where 
R, = AR, and if after intervals At, and At, the contacts have radii (r,), 
and (r,), where (r,), equals A(r,), HeRRineG sought to relate the times 
At,, At, with the mechanisms of sintering. 

When considering a viscous flow process in a crystalline material it is 
not correct to take the expression P = 2y/R as the pressure inside a 
particle of radius A, for the stress will vary over the surface and 
shearing stresses will occur in the interior. For viscous flow HERRING 
finds At, = AAt,. For evaporation and condensation when sintering 
particles of radius 10-* cm or more At, = /A*At,. Volume diffusion would 
give At, = /°At,. With surface diffusion At, = /A*At,, if it is assumed 
that rate of migration is proportional to the gradient of a chemical 
potential. Experiments have not yet been made on this line. Great care 
will be required to avoid surface contamination of a specimen and to see 
that one crystal orientation does not dominate one sample. 

To explain the increase in average pore size of pressed copper speci- 
mens which they observed on sintering Rares, BrRcHENALL and 
Hvucues® consider volume diffusion. They found that the concentration 
of small pores decreased sharply with sintering time (Fig. 3) indicating 
that the shrinkage of small pores is very high; on the other hand, the 
largest pores, those of volume more than 10-* cm, increased in volume. 
There is nothing in any slip mechanism of sintering to explain the 
growth of pores. The authors suppose it to be due to volume diffusion, 
vacant lattice site concentrations being higher in the vicinity of small 
pores than of larger ones, so setting up concentration gradients and 
diffusion. This implies that rapid maintenance of equilibrium concen- 
trations occurs by the development or condensation of vacancies. 
Diffusion of this type between pores and the external surface of a 
specimen is suggested as the cause of overall shrinkage. 

By assuming that the thermal energy of a crystal may be assigned to 
sites according to the Boltzmann distribution one may calculate the 
fraction, f, of sites which at any temperature have more energy than that 
required to form a vacant site. This energy was taken as 39-700 cal/g 
atom. This calculation gives the following values of /: 


Temperature (°C ; 1000 


The Kelvin equation is taken as giving concentrations of vacant sites 
near curved surfaces. Using these values, Rarwes, BrecHENALL and 
HvGHEs estimate the size of pore which will neither grow nor shrink in 
a rather idealized model. In this each pore is supposed to have nine 
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neighbours; it gains vacant sites from smaller pores and loses them to 
larger ones or to the free surfaces. The proportion of sites leaving one 
size of pore which reaches pores of any other size depends upon their 
area of cross section. 

The results agree quite well with observation for long sintering times 
at high temperatures. It is suggested that quite large pores will never 
be removed by heat alone. Another effect forecast is that a non- 
wetted inclusion above a minimum size ought to provide a surface for 
vacant sites to discharge at, and so come to occupy a pore larger than 
itself. The shape and size of a specimen should also affect sintering 
rate; it seems improbable that this is so. 

RHINES, BrRCcHENALL and Hucues think that evaporation and 
condensation could provide a rapid enough change of pore shape; at 
1000°C they estimate a rate of condensation at sharp junction angles 
of about 10-" g/sec; in a typical case of sintering about 10-*g of 
material would flow into the junction during the rounding of pores. 

It becomes gradually more clear that under properly controlled 
circumstances the mechanism of sintering may be dominated by one or 
other of different processes, particularly by viscous flow or by diffusion. 
There is still uncertainty about what factors are most important in 
selecting a mechanism, and so this cannot yet be set up from first 
principles and conditions for sintering then forecast. Moreover, many 
simple points of the course of sintering are yet unsettled; whether, for 
instance, pores do in general become spherical rapidly or only do so 
much more slowly than the shrinkage changes. The general course 
of sintering is still uncertain; it is known that much porosity is 
eliminated before pores are isolated, that small pores are removed 
before larger ones which may even grow; that the rounding of 
pores to spherical shape is of secondary importance; that grain 
boundaries running into pores may be important in their shrinkage ; 
that conditions which favour vacant site formation make sintering 
quicker but that transformation temperatures are themselves of small 


importance ; it is not clear whether porosity can be wholly removed by 


sintering alone. It would be valuable to have such points decided more 
certainly. 

The author is indebted to his colleague Mr. F. O. Jongs for frequent 
valuable discussions in connection with sintering: he wishes to thank 
Dr. T. E. Atirpone, F.R.S., for permission to publish this article. 


NOTATION 


Point of junction between the original surface of a sintering particle and 
of the surface of material which has moved during sintering (Figs. 6 
and 7 
Interatomic distance 

B  Proportionality constant in equation (4). 
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( Concentration of vacant lattice sites 

D., D, and D, Diffusion coefficients for volume, surface and grain-boundary 
diffusions 

D, Value of a diffusion coefficient at 0° Abs. 

D, Diffusion coefficient for lattice holes. 


E Activation energy. 

E Energy to produce a vacant lattice site. 
f Fraction of lattice sites which are vacant. 
G Energy. 

h Thickness of a flowing layer of matter. 


K Power of the junction lens radius, r,, which should be proportional to 
sintering time, ¢, for any given mechanism. 


k Boltzman constant. 
L Compacting pressure. 
l Short length at the junction between particles over which the drop in 


internal pressure occurs. 

M_ Rate of movement of matter. 

m Number of pores per unit volume. 

N Number of grains per unit length of a wire in which each grain fills the 
whole cross section of the wire. 

n Number of contacts made by a particle with its neighbours. 

P Pressure inside sintering particles. 

P Gas pressure. 

P,, Pp, Vapour pressures over surfaces of different radii. 

q Total pore volume per gramme of material. 

R_Particle or wire radius. 

R Gas constant. 





























r Radius. 
r, Lens radius (Figs. 6 and 7 
rs Radius of the surface of a lens (Figs. 6 and 7). 
T Absolute temperature. 
t Time. 
t, Relaxation time for condensation of atoms on a surface. 
U Index for hot-pressing relationship. 
u Radial velocity at the surface of a pore. 
V Lens volume in one gram of material. 
~ Shrinkage in volume when one gram of material is sintered. 
: Volume of a single Clark and White ‘lens.’ 
W Weight under which a loaded wire neither extends nor contracts when 

heated 
w Weight. 
x Radius of a pore. 
zx cos %. 

3 5 - 
x The expression 4, 2| ri -| 577g in equation (7 
bad ~yvm 

p Densification parameter 

Surtace energy 
Y,_ Grain boundary energy 
é Height of a lens (Fig. 7) 
n Coefficient of viscosity 
no Ultimate coefficient of viscosity of a Bingham solid. 
6 Half the angle subtended at the centre of a particle by the junction with 





another particle. 
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Herring’s scale factor. 

Density. 

Critical shear stress of a Bingham solid. 
, , ¢, Angles. 
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5 
THEORY OF DISLOCATIONS 


A. H. Cottrell 


INTRODUCTION 


THE fact that a second review of progress has become necessary, so 
soon after the first? (which will be referred to as T.D.1), shows that the 
theory of dislocations is being opened up rapidly. Some of this progress 
could have been anticipated from the ideas that were taking shape at the 
time the first review was being written but an important contribution 
has come from newer ideas, some of which have caused a radical change 
in outlook in certain parts of the theory. Judging from present-day 
ferment on topics such as the growth of crystals, work hardening, and 
the nature of slip bands, this pace of progress will be held for at least the 
next five years. 

When the first review was written the broad position was as follows. 
Arguments about the impossibility of the simultaneous movement of 
all the atoms in a slip plane and about the low yield strengths of crystals 
had convinced most people that slip takes place by the movement of 
dislocations. Simple types of dislocations, the edge and the screw, had 
been described and the newly discovered phenomenon of polygonization 
had strengthened belief in at least the first of these. The elastic proper- 
ties of dislocations were well understood and a detailed mathematical 
theory of their stress fields, strain energies, flexibility, forces of inter 
action, and mobility in simple lattices, already existed. The dynamical 
properties of dislocations were mainly unknown, although it had been 
recognized that dislocations could not move faster than sound, and that 
near this speed they would have enough kinetic energy to create new 
dislocations from themselves. This dynamic creation of dislocations was 
thought to be the basic process in the growth of slip bands. The source 
of the original dislocations was still a mystery, however. 

Age-hardening had already been discussed in detail from the point 
of view of dislocations and a start had been made towards explaining 
the yield point of iron in terms of the anchoring of dislocations by 
segregated impurity atoms. The theory had also been applied to the 
analysis of the structures of grain boundaries, but not to the strengthen- 
ing effect which they have. Several attacks on the work hardening 
problem had been made, but these had only penetrated far enough to 
show how formidable it really is. So long as it remained unsolved there 
could be little progress, apart from preliminary skirmishes, with the 
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closely related problems of recrystallization, creep, fatigue, and 
ductile fracture. 

The most exciting of the new developments has been in crystal growth. 
Difficulties in the classical theory led to the idea that the growth of real 
crystals is catalysed at places where dislocation lines meet their surfaces. 
Many crystals contain dislocations because they could not have grown 
otherwise. Spectacular evidence supporting these ideas has come with 
the actual observation of the “growth spirals’’ predicted by the theory ; 
it is now possible to locate, and even “see,”’ dislocations in certain 
crystals. 

An important new idea is the ‘““Frank—Read source.” This shows that 
certain geometrical arrangements of dislocation lines are able to create 
from themselves large numbers of new dislocations by a process not 
involving kinetic energy. This idea of static creation is not only oppor- 
tune, because doubts have been raised about whether dislocations 
can move fast enough for dynamic creation; it is also something long 
needed for understanding slip bands, deformation twins, and work 
hardening. Another important idea is the creation and absorption of 
vacant atomic sites at dislocations. Things that may depend on this 
are the Kirkendall effect in diffusion, polygonization, work hardening, 
and annealing. 

Various advances have been made in the elastic theory of dislocations, 
including the distribution of dislocations in slip bands, the energies of 
small-angle crystal boundaries, the effects of interatomic forces on the 
widths and mobilities of dislocations, and numerous effects concerning 
the interactions of dislocations with each other, with grain boundaries, 
and with other kinds of elastic singularities. The forms of dislocations 
in various crystal lattices have begun to be examined, and this has led 
to the recognition of various “imperfect’’ dislocations that line the 
boundaries of sheets of stacking fault in the lattices. These imperfect 
dislocations are believed to be important in determining slip planes, in 
deformation twinning, and in work hardening. 

The theory of the yield point and strain ageing has been elaborated 
and experimental support gained for some of its predictions. Problems 
concerning the obstruction of slip at grain boundaries have been clarified 
and the attack on work hardening has continued. Observations on 
cross-slip, multiple slip, deformation bands, and the fine structure of 
slip bands, have suggested new ways of looking at the work hardening 


problem. In particular, interactions of dislocations other than the 
classical elastic ones have attracted some attention, for example, 
parallel dislocations on intersecting slip planes may sometimes run 
together to form immobile dislocations. Although the work hardening 
problem is not yet solved, or the problems of recrystallization, creep 
etc. that are related to it, several new advances have been made and 
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we can at least now say that dislocations seem to have that versatility 
in their properties that the complexities of these processes demand. 

A satisfying recent trend in the theory as a whole is that it is be- 
coming more quantitative, ie. making quantitative predictions that 
can be tested experimentally. Two examples of this are the dependence 
on orientation of the energy of small-angle boundaries, and the time 
law of strain-ageing in iron. 


THe GROWTH OF IMPERFECT CRYSTALS FROM VAPOURS AND 
SOLUTIONS 


The driving force that makes a crystal grow from its vapour is deter- 
mined by the ratio of the pressure of the vapour to the vapour pressure 
of the solid. Growth from a liquid solution similarly depends on the 
degree of supersaturation, while growth from a melt depends on the 
degree of undercooling. 

The process by which a perfect crystal should grow has been analysed 
in detail in the classical theory of Gress, VoLmMEeR, BECKER and 
DorineG, and others (reviewed in references 2, 3, 4). Although the 
lattice is perfect, the crystal may have imperfect faces which consist of 
terraces of steps, each step being the edge of a single sheet of atoms or 
molecules which covers part of the face. Growth can occur easily on 
these imperfect faces, even when the degree of supersaturation is small, 
because atoms can deposit in the re-entrant corners provided by the 
steps and become firmly attached to the crystal by bonding with two or 
more of its atoms. This state of affairs cannot continue indefinitely, 
however. As the steps advance by the repeated attachment of new 
atoms they leave behind themselves smooth close-packed crystal faces 
on which there are no projections against which later atoms could lodge 
themselves. The crystal cannot grow further unless atoms condense on 
these smooth faces and assemble together to make a nucleus for the 
next layer. This is difficult because isolated atoms or small groups of 
them can be thrown off easily by thermal agitation, and the chance of 
a sufficient number of them coming together to make a permanent 
attachment to the crystal is small if the vapour is only mildly super- 
saturated. This conclusion can be verified by mathematical analysis 
of the nucleation process*: *. * and it has been shown in this way that 
perfect crystals could only grow at the rates observed with real crystals 
if the degree of supersaturation exceeded about 50 per cent. 

Most experiments contradict this result. For example, VOLMER 
and ScutLtTze,® working on the growth of inorganic and organic 
crystals from vapours, showed that the rate of growth is practically 
as fast as the rate at which molecules strike the surface from the 
vapour, even when the supersaturation is only about 1 per cent. More- 
over, the rate of growth is directly proportional to the degree of 
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supersaturation, whereas the theory predicts a spectacular exponential 
dependence. 

FraNkK*: * realized that neither the theory nor the experiments were 
wrong, but that they applied to different crystals. The crystals which 
grow so easily in practice are imperfect. In particular they contain 
dislocations. The importance of dislocations in crystal growth can be 
seen as follows. What is needed for continual and rapid growth is an 
imperfection of the crystal faces which ensures that growth steps are 


4 
a 


Fig. |. A screw dislocation AB which ends at a point B ona 
crystal face 


always on them, no matter how much they grow. As Frank pointed 
out, this condition is ensured if screw dislocations end at these faces (or 
edge dislocations if these approach the surface obliquely so that there is 
a component of slip standing out of the surface). Fig. 1 shows a 
crystal face on which there is a point of emergence B for a screw dis- 
location AB in the crystal. From the definition of a screw dislocation 
(T.D.1, page 79) the direction of slip is along AB, so that a step BC is 
formed on the face running outwards from the point of emergence. 
Growth can occur by condensation along the step, and a new layer is 
added as the step rotates about its point of emergence. 

The essential point is that this step never disappears. Because of its 
special topological property of ending in the middle of the face, which is 
a consequence of the geometry of the dislocation, the step is always 
present no matter how much the crystal continues to grow. It does not 
retain the simple form shown in the diagram, of course. Since that 
part of the step near to the point of emergence requires the addition of 
only a few atoms to make one revolution, whereas outlying parts need 
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large numbers of atoms, the centre rotates faster than the outside, and 
the step winds up into a “growth spiral,’’ as in Fig. 2. 

Frank’s theory also considers growth from several points of emer- 
gence on the same face. The rate of growth is, over a wide range, not 
very sensitive to the density of dislocations which emerge on the face. 
An important case is that where two dislocations of opposite signs 


| 
| 








Fig. 3. Stages in the formation of a growth 
ring from a step joined to two points of 
Fig. 2. A single growth spiral emergence, A and B 


emerge and are joined by a common growth step. This step generates 
not a continuous spiral but a series of concentric growth rings, as in 
Fig. 3. 

Striking confirmation of this theory has been provided by the actual 
observation of growth spirals and rings, showing all the features 
expected from the theory, on the faces of various crystals. Fig. 4 
shows the original example by GRIFFIN‘ on a crystal of beryl. This has 
been the fore-runner of many others observed on various inorganic and 
organic crystals.’ By using electron microscopy, multiple-beam inter- 
ferometry, and other optical methods, it has been proved that the 
steps are equal in height to the lattice constant of the crystal, or are 
smal! multiples of that height. 

Because of their small lattice constants, and for other reasons, metals 
are not the easiest materials for showing growth steps. However, by 
depositing large molecules on them, the steps can be made visible, and 
this method has been used by Forrty* to show growth spirals in a 
magnesium crystal grown from a vapour. Fig. 5 is an example; the 


209 











PROGRESS IN METAL PHYSICS 


height of the step in this case is probably equal to the ¢ axis of the 


crvystai 

FRANK'S theory explains the growth of imperfect crystals, not their 
nucleation, and separate considerations are needed to decide how the 
dislocations originate in the imperfect nuclei. FRANK suggests various 
possibilities for this, e.g. (a) nucleation in regions of high supersaturation, 
6) plastic buckling of very thin and fragile nuclei, (c) nucleation on an 
imperfect substrate, (d) buckling under large strains created by a 

gradient of composition, and (¢) the collapse 

of sheet-like cavities formed by the aggrega 
7 tion of vacant atomic sites. From a practical 
standpoint the origin of these imperfect nuclei 
is not important; what matters is that they 
grow in preference to perfect ones. 


Tae Static CREATION OF DISLOCATIONS 

ve fe a In 1950, at a conference in Pittsburgh, Franx 

lines and Reap* pointed out that slip bands might 
grow by a dislocation mechanism similar in 
some ways to that for the growth of crystals. The dislocations concerned 
in crystal growth need not lie in slip planes. In fact, as Frank has sug- 
gested, they are likely to form a three-dimensional network structure, as 
shown in Fig. 6. In such cases we can expect dislocation lines which for 
part of their length lie in a slip plane, and then at some point turn out of 


this plane along an intersecting direction, or join other dislocations 





. j t ™ : 
————— ee “tind” 
4 b 
Fig. 7 The formation of a slip band by the rotation of a 


dislocation line BC about an internal point of emergence B 


A simple example of one of these Frank—Read 
sources is shown in Fig. 7. Here an edge dislocation ABC forms the 
internal boundary of a quarter-plane ABCD of atoms. Suppose that 
the dislocation line BC can move freely in its slip plane CEF and that 


lying in other planes 


the part AB is for some reason unable to move. Then B is an internal 
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THEORY OF DISLOCATIONS 


point of emergence and the dislocation BC must remain attached to it 
as it glides in its slip plane. The dislocation rotates about this point and 
winds up into a rotating spiral, as shown, which produces a translational 
slip of one atom spacing each time it makes one revolution. 

Pieces of dislocation with two internal points of emergence, such as 
those in Fig. 6, can generate slip bands by making dislocation rings. 
Fig. 3 illustrates this process if the lines in these diagrams are thought 
of as dislocations. When the scale / of the network structure is small 
compared with dimensions of the crystal most of the Frank—Read 
sources will be of this type. Only those within a distance / from the 
surface will be of the single-ended type shown in Fig. 7. 

The stress needed to start a source working is important. Consider a 
double-ended source, such as those in Fig. 6. The dislocation has a 
line tension (T.D.1, page 117). This causes it to contract in length until 
it is a straight line joining its two points of emergence. Before the 
source can be set in action this line has first to be bent into a loop of 
radius 1/2 (Fig. 3). The applied stress o needed to do this (T.D.1, 
equations 28 and 29) is about 


ub/l, a 6. So ee 


where yu is the shear modulus and 6 is the strength of the dislocation, 
i.e. the length of the slip displacement it produces.* Since } is normally 
one atomie spacing this stress is about the yield stress of a single 
crystal when / is 10* atomic spacings. 

FiIsHER”™ has pointed out that single-ended sources of the same length 
begin to work at only one-half of the above stress. This is because the 
other end of such a dislocation is not attached to a fixed point, but to 
some point along the line where the slip plane meets the surface of the 
crystal. Provided that the dislocation meets this line perpendicularly, 
in order to preserve an equilibrium of forces here, this external point of 
emergence is free to move along the line. As a result it is only necessary 
to bend the dislocation into a loop of radius /, not 1/2, to set it working. 
FIsHER” has used this idea to explain an observation of Brown and 
HoNEYCOMBE" that, on a purely electropolished surface of aluminium, 
the slip appears in the form of innumerable fine markings, instead of 
the much coarser slip bands some 10-*—-10~* cm apart, which are typical 
of strained crystals with mechanically worked surfaces. FISHER argues 
that when the surface is unworked many more Frank—Read sources are 
active at the surface than in the interior, since for a given length of 
source the surface ones become active at only one-half of the stress 
needed for the interior ones. On the other hand, when the surface has 


* To conform to recent practice, we now use yu and b for those quantities symbolized 
by G and A, respectively, in T.D.1. 
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been mechanically worked these surface sources are less active because 
of the cold-working strains there. 

There can be little doubt that in practice a slip band can be made 
very quickly, in a fraction of a second. Can a single Frank—Read source 
make a slip band as fast as this! The critical thing is the radius of 
curvature of the central loop of the rotating dislocation spiral, which is 
about ub/o. This central loop makes about v/{27(ub/c)} revolutions per 
second, where v is the linear velocity of the dislocation line in the slip 
plane. The time needed to make a typical slip band, e.g. 1000 revolutions, 
is of order 10°(27ub/vc) seconds. For v we shall assume the value 
10* cm sec-', which is about 0-1 that of sound (see next section). 
Taking also o/u = 10~ and 6 = 3 x 10-* cm, the time to make the 
slip band is less than 0-0002 sec. 


Tae Dynamic Properties or DISLOCATIONS 


The above theory of the static creation of dislocations raises the question 
of the present status of the earlier dynamic theory (T.D.1, page 100), 
and this leads to the more general question of the dynamics of moving 
dislocations. 

The original conclusion of FRENKEL and Kontorova (T.D.1, page 99) 
that the energy of a dislocation rises to infinity with its speed according 
to the “relativistic” formula 


U = U,/(1 — v2/c2y"2, coe a 


where U, is the rest energy of the dislocation and c is the speed of 
sound in the medium, has now been confirmed by the analysis of 
dislocations in three-dimensional bodies."*. *.1* A simple case is that 
of a screw dislocation lying along the z axis and moving along the x 
axis at a steady speed v. When v = 0 the elastic equation of equilibrium 
governing the strain field of the dislocation is 


(3) 


where w is the component of displacement along the z axis, the other 
two components being zero. For a simple screw dislocation of strength 
6 this has the solution 


w = b(6/27) = (6/27) tan (y/z), > eas (4) 


where 6 is the angle measured about the screw axis from some arbitrary 
reference line. 

When this dislocation is moving the equation of equilibrium is found 
by equating the force acting on any volume element due to the stress 
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field to the force of inertia due to the acceleration of the matter in this 
volume element. Equation (3) then becomes 


ow ew 1 dw i 
da? * dy? ~ oon’ °) 
where ¢ is the time and 

c = (u/p)'* ¢ dives ou. 


is the speed of transverse sound or shear waves in the medium, p being 
its density. We need a solution that reduces to equation (4) when 
v = 0, and moves along the z axis with a constant form and constant 
speed v. Equation (5) is a wave equation and it is known from general 
experience with these in electrodynamics that steady propagational 
solutions are formed by making a Lorentz transformation. In the present 
case this means substituting the variable 


a, = (x — vt)/(1 — v*/c?)'* ¢ ¢3> e 


in the equation. Noting that for a steady propagational solution the 
relation 3?/df? = v*(d*/dz*) holds, the equation then becomes 


ow ow 
2,2 soy” 


= 0, secs GC 


which is identical in form with equation (3). Evidently the required 
solution is simply 
w = (b/27) tan (y/z,) Po. ae: ae 


The field of the moving dislocation is the same as that of the stationary 
one except that xz, replaces x. The factor (x — vt) in x, accounts for the 
steady propagation of the dislocation while the Lorentz contraction 
factor (1 — v*/c?)'* shows that the range of the stress field along the 
axis of motion is reduced. The energy of the moving dislocation can be 
found from this result. We add the density of elastic energy 
4 ul (dw/dx)? + (dw/dy)*] to the density of kinetic energy $p(dw/dt)* and 
a short calculation for dw/dz, dw/dy, and dw/dt then leads to equation (2). 
This equation shows that a dislocation cannot reach the speed of 
sound because its energy would become infinite. An important question 
is whether it can reach a speed about 80 per cent that of sound, for then 
its behaviour would become significantly different from that of a slow 
dislocation ; for example, it would have enough excess energy to create 
new dislocations. The practical speed limit is reached when the energy 
gained by the moving dislocation from the applied stress is balanced by 
that lost from the dislocation by various dissipative processes. Some 
of these processes have been considered by Serrz,!’ EsHevpy,’® 
LEIBFRIED,!’ and NaBaRrro."* The main ones are as follows: 
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Thermoelastic Damping 


In an elastic solid a change in hydrostatic pressure causes a change 
in temperature. When an edge dislocation moves the hydrostatic stress 
field caused by the dislocation also moves and so the temperature at any 
fixed point is continually changed. Heat flows down the temperature 
gradients set up in this moving field. This is a thermodynamically 
irreversible process and, therefore, dissipates the mechanical energy of 
the system. Esnetsy’* has shown that thermoelastic damping is 
unlikely to prevent the dislocation from reaching “‘relativistic’’ speeds. 


Radiation Damping 


This occurs when elastic waves are transmitted outwards from the 
moving dislocation. Because of the atomic structure of the crystal and 
the breakdown of Hooke’s law at large strains (T.D.1, page 84), the 
precise arrangement at the centre of a dislocation alters periodically 
as it moves from one lattice row to the next. The frequency with 
which it alters is v/b, where v is the speed and 6 the lattice spacing in the 
direction of motion. Elastic pulses are transmitted out to the sur- 
rounding crystal with this frequency and dissipate some of the energy 
of the dislocation. The mathematical analysis of these pulsations is 
difficult, and their effect is correspondingly uncertain, but NaBarro’* 
believes that the dissipation is large enough to prevent relativistic 
speeds being reached unless the applied stress is at least one order of 
magnitude greater than the yield stress of soft crystals. Serrz” has 
reached the same conclusion by a different line of argument. 


Scattering of Sound Waves 


When a sound wave runs into a dislocation both may be disturbed.” 
The inhomogeneous strain field at the centre of the dislocation refracts 
the sound wave through an angle. In turn the sound wave imposes an 
oscillating stress field which may make the dislocation oscillate in its 
slip plane and radiate out elastic waves. The effect of these is that the 
impinging sound wave exerts a force on the dislocation. When the 
dislocation is moving it runs into more sound waves on the leading 
side than on the trailing one and so is subjected to more forces pushing 
it back than forward. By assuming that a moving dislocation scatters 
sound waves in the same way that a line of atoms would, Lerprriep"’ 
concludes that the damping due to the oscillation of the dislocation is 
so severe that fast speeds are quite impossible at the yield stresses of 
soft crystals. However, NaBparro’™ has pointed out that Leibfried’s 
argument does not allow for the fact that the scattering power is 
proportional to the wavelength of the impinging waves. Those waves 
approaching the dislocation from behind have an increased effective 
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wavelength, due to the Doppler effect, and are, therefore, scattered 
more completely than those approaching from the front. In fact, this 
difference in scattering power exactly cancels the difference in the 
number of sound waves met by the dislocation on the two sides, so that 
Leibfried’s effect does not exist, at least to the first order in v/c. 

We see that, although no firm conclusion has been reached, opinion 
is generally against the idea that dislocations can reach relativistic 
speeds, at low stresses at least. This opinion is supported by an experi- 
ment of LEIBFRIED,” who studied the extension of loaded aluminium 
crystals with an extremely sensitive extensometer and could not detect 
any of the sharp sudden increments of strain that would be expected if 
slip bands were forming from dislocations moving at relativistic speeds. 

Nevertheless, various suggestions have been made recently for 
dislocation processes involving relativistic speeds. Orowan™ has 
suggested that the unidirectional contraction in the stress field of a fast 
dislocation will cause it to turn out of its original slip plane, and he has 
used this to explain a spread of slip from one plane to another. HoLDEN** 
has suggested that, at the upper yield point in iron, a dislocation that 
has broken free from its solute atoms (T.D.1, page 97) reaches high 
speeds and uses its kinetic energy to free nearby anchored dislocations. 

Mortr and FisHer, Hart and Pry™ have used the idea of dynamic 
creation of dislocations at Frank-Read sources to explain the charac- 
teristic size of slip bands. They both consider double-ended sources and 
suggest that the stress ub/] is necessary only to start the source working. 
The first loop of a sequence formed at the source must grow from a 
standing start and needs this stress, but all loops formed immediately 
afterwards in the sequence have a running start and do not need it. The 
dislocation line bridging the source has kinetic energy as it moves 
through its position of minimum length and, if dissipation is small, this 
energy is enough to carry the dislocation forward to the critical semi- 
circular shape beyond which it can expand into the slip plane once more. 
If this is true it means that once a source starts working it continues to 
do so until the stress round it has been reduced temporarily by an 
amount ub/l. This fall in stress is considered to occur as a result of the 
“‘back-stress” set up from the dislocations themselves created by the 
source, either when they become piled up against some obstacle (such 
as a deformation band) in the crystal,** or while they are still advancing 
out into the slip plane. In both cases several hundred dislocations 
have to be created before the stress at the source falls sufficiently to 
stop it working, and this is held to account for the slip “avalanches” 
that the observations of HEIDENREICH and SHOCKLEY,” and Brown” 
seem to require.* 

* See, however, the recent work of H. Wirsporr and D. KuniMann-WILSDORF, 
Z. angew. Phys., 1952, in press. 
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THe CHARACTERISTIC DISLOCATIONS OF CRYSTALS 


The most important feature of a dislocation is its Burgers vector. This 
is the vector defining the movement of the material on one side of a slip 
plane, relative to that on the other side, when the dislocation glides 
through this plane. A more sophisticated definition has been given by 
Frank.*” We shall now consider what conditions have to be satisfied 
by the Burgers vectors of dislocations in various crystal structures. 
Dislocations with such vectors will be called the characteristic disloca- 


tions of the crystals. 

The essential condition is that the slip displacement which the 
Burgers vector represents must be such as to translate the atoms from 
initial positions to final positions where they are mechanically stable, 











Fig. 8. Unit vectors in the simple cubic lattice 


since mechanically unstable configurations cannot exist. There are two 
kinds of such allowed translations, those that preserve the lattice 
(identity translations) and those that generate a new configuration, such 
as a twinned lattice. Dislocations are said to be perfect or imperfect, 
respectively, according as their Burgers vectors give identity transla- 
tions or new configurations. Imperfect dislocations lie along the 
boundaries of stacking faults in the lattice. 

Dislocations with large Burgers vectors, even though these give 
allowed translations, may be unstable for another reason. The elastic 
energy of a dislocation is proportional to the square of its Burgers 
vector, so that the energy is reduced when large dislocations split up 
into smaller ones. For example, if the lattice constant is 5 a perfect 
dislocation with strength 2) lowers its energy from 45* to 26? if it splits 
into two widely separated dislocations each of strength 6. Franx™ 
has generalized this into a simple rule. In any combination or dissocia- 
tion of dislocations the elastic energy is lowered when the sum of the 
squares of the Burgers vectors is smaller for the product dislocations 
than for the reacting ones. 

The smallest perfect dislocations are called unit dislocations. In a 
simple cubic lattice of spacing a the unit vectors are a[ 100], a[110], and 
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a{111}, as shown in Fig. 8. In this notation a vector a/n[hkl] is one with 
components ah/n, ak/n, and al/n, along the principal crystal axes. 
Dislocations with vectors a[100] have the lowest energies, but the 
dissociations 


a{111} + af[110] + a[001] + a[100] + af010) + af001] . . . . (10) 


lead to no reduction in elastic energy, according to Frank’s rule. 
However, they raise the configurational entropy, since there are more 
ways of distributing many dislocations of small strengths than few 
dislocations of large strengths through the crystal. 

A unit dislocation has lowest energy when its Burgers vector is 
parallel to a direction of closest packing in the lattice, which explains 
why this is the preferred direction of slip. The choice of slip plane is 


























Aa 


Fig. 9. Dissociation of a unit dislocation (a) into two 
partial dislocations (b) 


less easily made, both in theory and practice. According to NaBarro’” 
this depends on two factors, both of which favour planes of wide 
spacings: (a) in a close-packed lattice a unit dislocation will dissociate 
into partial dislocations (see below) when it lies in a close-packed plane ; 
this leads to a lowering of strain energy and so dislocations should collect 
preferentially in these planes; (6) the Peierls—Nabarro lattice force 
(T.D.1, page 90) resisting the movement of a dislocation is smallest 
when the Burgers vector is as small as possible and the spacing of the 
slip planes holding the dislocation is as big as possible. CHALMERS and 
Martius” define a parameter f, which is the strength of the dislocation 
divided by the spacing of the slip planes, and point out that most 
observed slip systems can be predicted from the rule that the preferred 
one is that with the smallest /. 

If the lattice is suitable a unit dislocation may dissociate into 
partial dislocations, which are imperfect dislocations with Burgers 
vectors less than one atom spacing in length. This can happen when the 
crystal has more than one mechanically stable configuration. Fig. 9 
illustrates this diagrammatically. Here a unit edge dislocation dis- 
sociates into two partial dislocations, A and B, which are joined by a 
strip of faulted material where the configuration is altered. The vector 
sum of the Burgers vectors of the partial dislocations is equal to the 
Burgers vector of the original dislocation. The question of how far the 
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partial dislocations separate depends on the energy of the stacking 
fault between them. This energy is proportional to the area of slip 
plane covered by the fault and hence to the distance between the 
dislocations at its boundary. Thus if ¢ is the energy per unit area of the 
fault, the force per unit length pulling the dislocations together is also e. 
The repulsive force due to the elastic interaction of the dislocations is, 
by a generalization of equation (9) in T.D.1, a(b, . b,)/27kr, where 
(b, . bg) is the scalar product of the Burgers vectors and r is the distance 
between the dislocations: £ = 1 for screw dislocations, and (1 — v) 
for edges, where vy is Poisson’s ratio. The dislocations are in stable 
equilibrium at the distance 


r= u(b, . b,)/2rke Ps & & 


where these two forces balance. In general this is only a few atom 
spacings. The two partial dislocations thus do not separate very far 
and to a distant observer appear like a single dislocation; they are 
referred to jointly as an extended dislocation. In crystals in which unit 
dislocations are able to extend themselves the plane of slip is more 
clearly defined than in others, because an extended dislocation has a 
unique plane, the plane of its stacking fault, in which it can move easily. 

We shall now consider the characteristic dislocations of some particu- 
lar crystal structures. 


Face Centred Cubic Crystals 


The shortest lattice vector is a/2{110], which joins an atom at a cube 
corner to a neighbour at a face centre, and defines the observed slip 
direction. Unit dislocations in {111} planes can lower their energies by 
splitting into two partial dislocations, the half-dislocations of HEIDEN- 
REICH and SHocKLey.* Consider slip on a (111) plane, as in Fig. 10. 
The vector b, = a/2{101] defines an observed slip direction. THompson 
and Mituryeton™® and MaTHEewson™ pointed out that, if atoms are 
regarded as balls rolling over one another, then one in a position of type 
B would roll most easily to a nearby position of type C. On an atomic 
scale of magnitude the atoms are expected to follow a zig-zag path 
B—+C-—B etc, slipping alternately along [211] and [112], with a 
mean slip direction [101]. 

A slip from B to C changes the order of stacking of the (111) layers. 


A perfect sequence ... ABCABCABC ... becomes the faulted 
sequence .. . ABCACABCA .. . which contains a stacking fault 
CACA.. . consisting of a thin layer of hexagonal close packing. 


Such a fault preserves the typical close packing of nearest neighbours 
and so should have a low energy of formation. This is confirmed by 
the observations that these faults are formed freely in cobalt** and 
lithium during phase transformations, and in certain copper-silicon 
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alloys during cold working.** Moreover, the surface energy of {111} 
boundaries of annealing twins in copper, where the stacking sequence is 
very similar to that of a fault, is only 21 erg cm~* according to 
FuLiMan.* 

HEIDENREICH and SHOCKLEY*™ proposed the dissociation reaction 


a - a ee a - 
~ [101] + — [211] + - [112] ia a (12) 
2 6 6 





Fig. 10. Slip directions in a (111) plane 


for a unit dislocation in (111). This reaction is formally possible since 
the sum of the components of the Burgers vectors is the same for both 
reactant and product dislocations. Thus the right-hand side sums to 


. —— a a 
—(2+1,1+1,1+4 3] = — [303] =-[10l]. .... (13) 
6 6 2 
It is also energetically possible since by Frank’s rule the energy is 
lowered from a?/2 to a?/3 when it occurs. In Fig. 10 the reaction is 
represented by b, — b, + b,. The original unit dislocation splits into 
two partials which move apart in the (111) plane and generate a stacking 
fault in the region between them. The equilibrium spacing is given by 
equation (11) and if we take « = 20dyne cm™, (b, .b,) = a*/12, 
u = 4 x 10° dyne cm-, and a*? = 10- cm’, this spacing is 10 atoms. 

A shear stress applied along [101] drives both dislocations in the 
same direction, since they both contribute to a [101] slip, but a stress 
applied along [121] could either separate them or close them together. 
The stress to separate them completely is large, about 10~*, as is also 
the stress to force them together. Except in the presence of such 
stresses they always preserve their equilibrium spacing and move as a 
unit in the (111) plane. They cannot glide out of this plane, because 
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their stacking fault prevents this. If they should be forced together, 
to make a unit a/2{101] dislocation, and this happens to be a pure 
screw it may then dissociate into two other partials, a/6[211] and 
a/6{1132}, lying in the (111) plane. This reaction, which would cause 
cross-slip,*5, %*. 37 might occur at places of high stress concentration or 
at points of emergence for the a/2[{101] dislocation, but apart from this 
deviations from slip on single (111) planes will be rare. 

Frank” has suggested another type of partial dislocation, the 
sessile dislocation, in the face centred cubic lattice. Suppose in Fig. 11 
that we are looking sideways at the 
edges of (111) planes. There is a 

5) fault in the centre where part of 
___———_____ + the central A layer is missing. 
eee The neighbouring B and C layers 
collapse towards each other in this 
region and fit together in correct 
close-packed order in the faulted 
sequence ABCBCA. 

At the rim of this region an edge dislocation is formed with a Burgers 
vector a/3{111]. Its direction of glide is [111] but it cannot move this 
way because this would involve joining pieces of B layer to A layer etc 
which would leave badly disarranged atoms on the surface delineated by 
its track. The dislocation is therefore unable to glide, as its name 
implies. Sessile dislocations can be made either by the aggregation of 
vacant sites into sheets on {111} planes, forming disc-shaped holes in 
these planes whose sides then collapse, as in Fig. 11,**- *® or by the 
dissociation of unit dislocations; for example, the reaction 
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Fig. 11. A sessile dislocation 


a a a 
- —> = de wy 5 
5 [110] > = [111] + & [112] + ite sole 


produces a sessile dislocation and a half-dislocation, both lying in (111). 
Since face centred cubic crystals can glide on intersecting {111} 
planes reactions may occur between dislocations moving on these 
planes. One of the most interesting is that studied by Lomer.” He 
considered two unit dislocations, both parallel to [110]. The first 
moves in (111) and has a Burgers vector a/2[{101], while the second moves 
in (111) and has a vector a/2{011]. By Frank’s rule they attract each 
other and so run down their respective slip planes until they meet and 

coalesce at the [110] line where these planes cross. The reaction is 
a 


» 


¢ a a 
[101] + = [011] + = (110). : ee Ge 


This new dislocation is an edge and cannot glide easily because its slip 
plane, i.e. the plane containing its line and its Burgers vector, is (001) 
and not a common plane for slip. 
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CoTTRELL™ has pointed out that this dislocation may become 
completely immobile by dissociating, 


[110] = [113] + 


° (112) + 2 f110). . 2. . (16) 
‘ 6 6 
The first two of these partial dislocations separate along their slip 
planes, (111) and (111) respectively, and produce stacking faults in 
these planes connecting them with the immobile a/6[110] dislocation in 
the site of the reaction, the line where these faults join. The mobile 
dislocations separate by a few atoms until a position is reached where 
the elastic repulsion on them is just balanced by the surface tension 
of their stacking faults. This triangular group of parallel dislocation 
lines obviously cannot glide. 


Close Packed Hexagonal Crystals 


The unit vector in the basal plane is of length a, the lattice constant in 
this plane, and follows a (2110) direction of closest packing, which is 
also the observed slip direction. Dislocations with such vectors can 
follow any path through the crystal but prefer to lie in basal planes 
where they can dissociate into half-dislocations, as in the face centred 
cubic lattice. Basal slip is common in this structure. The shortest 
lattice vector out of the basal plane is [0001] and of length c, the lattice 
constant along the hexagonal axis. Dislocations with this vector are not 
easily moved but are important in crystal growth. The growth spiral 
shown in Fig. 5 starts from such a dislocation. Dislocations with Burgers 
vectors along the hexagonal axis may also be important as pole disloca- 
tions in deformation twinning*: * (see below). 


Body Centred Cubic Crystals 


The shortest lattice vector is a/2[111] which joins an atom at a cube 
corner to the one in the cube centre, and defines the observed slip 
direction. 

This lattice can be twinned by a shear of 1/2 or V2 in a (111) 
direction on a {112} plane, and twins of this kind can be produced by 
deformation in iron. This raises the possibility of a unit dislocation in 
a (112) plane dissociating as follows 


a 


111] > 
toa} 6 


[111] + = [111], a 
y 3 
to make two partial dislocations separated by a stacking fault consisting 
of a monolayer of twinned crystal. This dissociation is less certain than 
those discussed above because the original packing of nearest neigh- 
bours is not preserved on the twin interface. These partial dislocations 
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have parallel Burgers vectors and cannot be separated by a homo- 
geneous stress. Also, they are both of pure screw type when they lie 
along [111], and since [111] is the line along which (112), (121) and 
(211), intersect, they can glide out of their original slip plane when in 
this orientation. Thus the limitation to a single slip plane is less severe 
than in the other crystal structures considered. 

A sessile dislocation may perhaps be formed by a dissociation of the 
type 


a a a . 
[111] + — [112] + — [111] eo Te 
2 3 6 


when the reacting dislocation lies in (112). The dislocation a/3[{112] is 

pure edge and sessile. It forms the boundary of a stacking fault made 

by inserting two fault planes in the (112) stacking sequence. The 

sequence is ... ABCDEFA .. ., and the fault is of the kind 
. ABCDCDEFA . 


Tae GrowTH or DerormMaTIOon TwINs 


The idea that deformation twinning takes place by the continuous 
growth on an atomic scale of twinned material has often been suggested, 
and the general arguments for a dislocation mechanism of twinning are 
the same as for slip: it is unbelievable that the atoms all move simul- 
taneously, and the stress for twinning is far below the theoretical shear 
strength of a perfect lattice. Examples of deformation twinning are 
known which consist either of simple shears on the twinning plane 
(body centred cubic iron) or of shears accompanied by localized 
re-arrangements (‘‘shuffles’’) of the atoms (hexagonal metal crystals). 
The shears can be produced directly by partial dislocations, but not the 
shuffles; it is presumed that these follow the shears automatically, 
since the structure will generally be mechanically unstable until the 
twinning process is completed. 

The main problem is to understand how twinning develops homo- 
geneously through successive planes. This seems to require either a 
twinning dislocation on every plane, or the motion of a single dislocation 
successively from plane to plane. CoTTRELL and Brtpy™ have recently 
suggested a mechanism, based on that of Frank and Reap for slip, 
whereby successive twinning planes are swept consecutively by a single 
twinning dislocation. Fig. 12 illustrates it. Here OA, OB, and OC are 
dislocation lines and C DE is the twinning plane. The twinning disloca- 
tion OC has its Burgers vector in this plane, and this vector is such that 
the dislocation produces the correct shear for twinning between the 
atoms bounding this plane as it sweeps through the plane, rotating 
about its point of emergence 0. The dislocations OA and OB are 
immobile and each has a component in its Burgers vector along the 
normal to the twinning plane; this component of the vector makes the 
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dislocations OA and OB partly or wholly of screw character, with a 
pitch equal to the spacing of the twinning layers. Because of this, these 
twinning layers are not a number of separate planes stacked on one 
another, but the leaves of a single continuous spiral surface (a helicoid 
or Riemann surface), winding round the pole dislocation AOB like a 
spiral staircase. It follows that, as the twinning dislocation rotates 
round the pole dislocation, it not only produces a monolayer sheet of 
twinned crystal but also climbs up the “‘staircase’’ to the next layer, 
where it repeats the process. In this way a thick layer of twin can be 
built up. The time needed to make a twin by this process is similar to 
that to make a slip band by the Frank- R 
Read mechanism, and it is known experi- 
mentally that deformation twins can form 
very quickly, in a few microseconds.*: * 
CoTTRELL and Brey have applied this 
theory to the body centred cubic lattice. 
Suppose that the line AOB in Fig. 12 was 
originally a unit dislocation with Burgers 
vector a/2{111] and that the part OB lies 
in a (112) plane. Under a suitable stress the 
reaction 18 can occur at OB, leaving OB Fig. 12. The pole mechanism 
itself as a sessile dislocation a/3[112] and of deformation twinning 
producing the twinning dislocation OC, 
a/6{111}. If OC turns into a twinning plane CDE which is (121) or 
(211) it can rotate in this about the pole dislocation and make a twin. 
A similar analysis can be made for the face centred cubic lattice, 
starting from the reaction 14, but in this case only a twin monolayer 
can be formed, not a thick twin, because the OB part of the pole 
dislocation, a/3[111], has to lie in the (111) twinning plane and the 
a/6{112] twinning dislocation runs into it at the end of one revolution 
in this plane. This process may be important in the production of single 
stacking faults in this lattice, a problem which has also been considered 
from the dislocation standpoint by CurisT1an.** Pole mechanisms for 
the growth of deformation twins in hexagonal metals have been sug- 
gested recently,**: * and in these the pole dislocation has a Burgers 
vector along the hexagonal axis. It is probable that shear transforma- 
tions of the martensitic type occur by the movement of dislocations,“ 
and Frank* has recently discussed the crystallography of the austenite- 
martensite transformation from this standpoint. The problem is 
complicated by the fact that the deformation is not a single shear.* 


INTERACTIONS OF DISLOCATIONS 


Three kinds of interactions involving dislocations have now been 
recognized, two of which are elastic in nature while the third is 
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geometrical. The first kind of elastic interaction is the ordinary long- 
range one discussed in T.D.1. We call this the “‘Hookean”’ interaction 
because it depends on stress fields round dislocations which in turn 
depend on Hooke’s law being obeyed at small strains. Recent work on 
Hookean interactions has been concerned either with confirming existing 
results by methods of greater rigour and power,”-™ or with developing 
new results, mainly about arrays of dislocations in slip planes or in crystal 
boundaries. Non-Hookean interactions originate in deviations of the 
atomic law of force in the slip planes from Hooke’s law, and are impor- 
tant when considering the stress necessary to drive a dislocation 
through a grain boundary, or a sheet of stacking fault, or any similar 
inhomogeneity in the material. Geometrical interactions occur when 
intersecting dislocations in different glide systems cut through one 
another. The “jogs’’ that are then formed in these dislocations have 
some interesting properties; in particular jogs in screw dislocations 
create vacant atomic sites or interstitial atoms when these dislocations 
are moved. We shall discuss some of the more interesting of these 
various effects. 


DrsLocaTions ry Crystal BounDARIES 


The idea that grain boundaries between crystals inclined at small 
angles to each other are composed of dislocations has already been 


discussed in T.D.1. It has attracted interest because the properties of 
these small-angle boundaries can be measured experimentally and so 
form a testing ground for the theory. SHockLey and Reap® have been 
mainly responsible for developing this line of approach. 

There are three properties capable of experimental study : 

(a) The relation between the spacing of dislocations in the boundary 
and the angle between the grains. 

(6) The mobility of the boundary under stress. 

(c) The energy of the boundary, and its variation with the angle 
between the grains. 

We shall discuss these from the standpoint of the classical Burgers 
tilt boundary shown in Fig. 13. Here two crystals are joined along the 
plane z = 0 after being rotated by equal and opposite amounts about 
the y axis. The angle of rotation is 

6 =< 2 tan (b/2h) = b/h, aes 


where A is the spacing of edge dislocations of strength 5 in the boundary. 
When this angle is sufficiently small (< 10-* radians) the spacing 
between the dislocations is large enough to be resolvable in a micro- 
scope. SHOCKLEY and Reap pointed out that the dislocation structure 
of such boundaries has in fact already been resolved by LacomBe® in 
a study of “veining”’ in aluminium. Lacomse found that a network of 
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faint lines existed in a crystal which consisted of rows of separate etch 
pits, and that there was a small difference in orientation between the 
regions separated by these “veins.”’ SHOCKLEY and Reap suggested 
that the etch pits were formed at the points of emergence of the disloca- 
tions in the boundaries. The order of magnitude of the angle 6 across 
such a boundary was 10-* radians or less, while the spacing of the etch 
pits was not more than about 3 x 10-*cm. Further work along these 
lines should enable a rigorous check of equation (19) to be made. 
The dislocations of the simple boundary of Fig. 13 are geometrically 
free to glide as a unit along their slip 
planes, although this property does 
not extend to some of the more com- 
plicated boundaries containing two or 
more kinds of dislocations.** More- 
over, the force from an applied stress 
on any one of the dislocations in this 
simple boundary is exactly the same 
as it would be (T.D.1, equation (1)) 
if this dislocation were alone present 
in the crystal, and so the boundary 
should move when a small shear 
stress is applied to the crystal. a 
Boundaries of this simple and mobile Fig. 18. The chugie Busgwe 
kind are frequently formed during the boundary 
bending and cutting of hexagonal . 
crystals,°’ where they are called bend planes, and direct visual 
observation of their movement under stress has now been obtained.®* 
The reason why the boundary of Fig. 13 is mechanically stable 
was given in T.D.1. When the horizontal distance between two parallel 
edge dislocations of the same sign is smaller than the (vertical) distance 
between their slip planes, they are attracted by their Hookean inter- 
action along the slip direction and pull into alignment, one above the 
other. The strain energy is smaller for the arrangement of dislocations 
in Fig. 13 than for other arrangements. The boundary in this diagram 
is a surface across which each crystal exerts forces on the other. These 
forces are associated with local indentations along the lines where a step 
on one face is pressed into the other face. These steps occur at intervals 
h up the face, so that the forces alternate in sign along the y axis with a 
period h. Elasticity theory shows that periodic forces of this kind, 
with zero resultant and moment, produce stress fields that penetrate 
inwards from the surface hardly beyond a distance of order h. 
Now the strain energy of a dislocation is logarithmically proportional 
to the radius of its stress field (T.D.1, equation (7)). It follows that the 
strain energy of a dislocation in the boundary is proportional to log h, 
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and this is smaller than could be obtained by any other distribution of 
the dislocations in the crystal. Dislocations occur at intervals A along 
the boundary so that the elastic energy of unit area of the boundary is 
proportional to (1/4) log A, i.e. to — 6 log 6. The energy contributed by 
the cores of the dislocations is proportional to the number of these, i.e. 
to 6. Hence the surface energy of the boundary should increase with 
angle according as the function 6(4 — log 6), where A is a constant.™ 

A general formula for the stress field of the boundary can be found 
by adding together the fields of the dislocations in it. * This gives the 
shear stress acting on the slip plane of one of the dislocations in the 
slip direction as 

ia ub w*x l (20) 
6 2a(1 — v) A® sinh® (az/h)’ eT to) ae 

where z is the distance in the slip direction from the boundary. We can 
use this to find the strain energy of the boundary. Suppose that we 
start with a single crystal and make the boundary by (a) cutting from 
one side to the centre along every slip plane that is to contain a disloca- 
tion, and (6) applying to the two faces of each of these cuts the force 
needed to displace one face rigidly past the other by the distance 6. In 
this process all dislocations in the boundary are to be created simul- 
taneously so that the applied forces are the same on all cuts. The total 
work done by these forces is the energy of the boundary. We know the 
magnitude of the force at every point on the cut faces from equation (20), 
and also at every stage during the creation of the boundary because, 
from Hooke’s law, the displacement of the cut surfaces increases linearly 
with the force on them. Thus the strain energy per dislocation is 
+ x force x displacement, integrated along the slip plane, i.e. 


ub? 


[log (7a/h) + log 2— 1], 


— 


where z, is the radius of the core of the dislocation, x, is the length 
of the cut plane, and where equation (20) has been used for ,,. Intro- 
ducing 6 for b/h, the energy of unit area of boundary is 


ub HA log 6) 
: . et on 9). 
4n(1 — v) "8 


where 4 = 1 + log (6/2rz,). ‘se 26 ae 


This result is a special case of a more general one derived by SHOCKLEY 
and Reap for boundaries containing more than one kind of dislocation. 
The contribution of the core energies can be included in the formula 
by suitably altering the magnitude of A, and hence z». Using the disloca- 
tion model of Prererts and Nasarro (T.D.1, page 90), VAN DER 
Merwe” has shown that A is typically about 0-5. 


226 





THEORY OF DISLOCATIONS 


The measurement of relative grain boundary energies in solids has 
been made possible by the methods of Smirx® and SHuTrLewortn, 
Kino and CHALMERS,*® which determine the relative surface tensions of 
boundaries from the angles at which three of them meet along a line. 
These have been used to test the 6(A — log 6) law by Dunw and 
LioneTri™ and Aust and Cuatmers.™ Fig. 14 shows the results of 
Aust and CHALMERS on lead compared with such a curve, using the 
value A = 0-2. 

It is surprising that there is agreement not only at low angles, where 
it might be expected, but also at large angles where the theory is no 
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Fig. 14. Variation of boundary free energy with difference in 
orientation between crystals of lead; (1) theoretical curve with 
A = 0-2; O experimental values of AusT and CHALMERS 


longer reliable. LomerR and Nye® have shown with the bubble raft 
(T.D.1) that the nature of the boundary changes rapidly, when 6 
exceeds a few degrees, from one made of dislocations to one in which 
dislocations cannot be recognized. In particular the dislocations, which 
are wide (T.D.1, page 91) when 6@ is small, contract when 6 approaches 
the range where the spacing between dislocations is equal to their 
width. This contraction changes them into defects resembling in some 
ways rows of vacant sites. 
NaBaRrRo’s’® explanation of the contraction is as follows. It is 
possible to regard an ordinary dislocation as a narrow assemblage in 
the slip plane of many dislocations of infinitesimal strengths. The 
width £ of the dislocation is governed by the elastic repulsions of these 
infinitesimal dislocations and by the misfit energy in the strip of slip 
plane containing it. Now a wall of dislocations such as that in Fig. 13 
can be regarded as a set of parallel and closely spaced walls, the com- 
ponent dislocations of which are the infinitesimal dislocations which 
define 8. This becomes important when the spacing / is smaller than 
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8, for then £ is determined by the elastic repulsion of walls of infinitesimal 
dislocations. As equation (20) shows, the stress field of a wall of dis- 
locations has a shorter range than that of a single one. The walls of 
infinitesimal dislocations therefore move closer together as 6 is increased 
into the range where h = 6, which is the effect that Lomer and NYE 
observe. 

DIsLOcATIONS IN Surp Banps 


The slip displacement at one end of a long slip plane can differ sub- 
stantially from that at the other end, as may be seen from the fact that 
the arrangement of slip bands on one side of a worked crystal is often 
quite different in detail from that on the other side. This difference in 
displacement is taken up by a set of dislocations spaced through the 
slip plane. Various aspects of such arrays of dislocations are interesting. 
On the purely formal side, it is possible to regard the whole set of dis- 
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Fig. 15. A row of edge dislocations pressed against 
an obstacle 


locations as a single dislocation of a more general kind, with a Burgers 
vector that is not constant along the slip plane.*’-” Another approach 
is that of Nyz,”! who, by a photoelastic study of internal stresses in 
deformed silver chloride crystals, has shown very clearly that some of 
these stresses are caused by rows of edge dislocations, all of the same 
sign, contained in bent slip planes. 

In T.D.1, page 105, it was shown that a row of n edge dislocations, all 
of the same sign, which are pushed against an obstacle in the slip 
plane exert a strong pressure against it (Fig. 15); in particular the 
leading dislocation in the sequence is driven against it by a stress n 
times the applied stress. It was also pointed out that the dislocations 
are spaced more closely together at the obstacle end of the sequence, 
and that because they are all of the same sign they bend the slip 
plane. The point of interest about this arrangement of dislocations is 
that it represents the spearhead of a slip band held up by an obstruction. 
The pressure exerted by the dislocations amounts in effect to a concen- 
tration of shear stress at the end of the slip band, and this concentration 
may enable the slip band to push through the obstacle into the crystal 
beyond it.” 

This stress concentration depends upon L, the length of the slip 
plane from the source S to the obstacle P, since this is a factor on 
which the number of dislocations piled up depends. We can estimate n 
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roughly as follows. Suppose that the source becomes active when an 
arbitrarily small stress acts on it, and that this stress is also sufficient 
to make dislocations glide in the plane. If a stress o is applied to the 
crystal the source feeds dislocations into the slip plane until the back 
stress from these cancels the applied stress. Roughly, then, the source 
stops working when the applied stress is relaxed to zero inside an 
approximately circular region of radius L centred on the source. The 
elastic strain in this region is correspondingly reduced from the value 
o/u to zero, and this is accomplished by a slip displacement 2L(a/,) 
at the source. To produce this displacement by dislocations of strength 
b the number required is n == SLelpb. eee (22) 
A more accurate theory has been given by EsHELBy, FRANK and 
NaBaRro,” which enables the positions of the dislocations to be found. 
These positions, denoted by zx, (i = 1, 2,.. .m), are defined as the 
values of the distance z along the slip plane at which the polynomial 


f(z) = (zx — z,)(2@— 2%)... (w@—,) «. ... (28) 


is zero. The equilibrium of the dislocations can be stated in the form of a 
differential equation for f(x) the solution of which is a polynomial equal 
to zero at n different values of x. In the problem of Fig. 15 it turns out 
that the distribution of the n dislocations along the slip plane is 
identical with that of the radial distribution of nodes in the ns state 
of the hydrogen atom. 

For large values of n the following results are obtained :”8 

(a) The number of dislocations packed into a length L of the slip 


plane is n = nLok/ub, ‘aw ow Ga 


where k is unity for a screw dislocation, (1 — v) for an edge and » is 
Poisson’s ratio. This compares with our equation (22) above. 
(b) The distance between the two leading dislocations is 


d = 1-84(ub/2rkno). hee o Gt 


(c) The shear stress on the glide plane at a distance x ahead of the 
leading dislocation is practically the same as that due to this dislocation 
alone in the region where x <d. At distances where d <x < L the 
stress concentration factor is approximately 


1 + (L/x)"?. sical ig 
This stress concentration can be large, and the dislocations then produce 
the same stress concentration as does a freely slipping crack of the same 
length.”4 
(d) The total slip produced by the sequence is the same as if a single 
dislocation of strength nb were moved a distance 3L/4. 
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FraNK” has suggested that this high concentration of stress at the 
end of a piled up row of dislocations may be sufficient in a poly- 
crystalline specimen to force the creation of dislocations on the other 
side of a grain boundary which is holding up the row. In the next 
section we shall turn to consider the nature of the force preventing the 
leading dislocation from entering the grain boundary. 


Note added in proof: D. Kun~mMann-Wiisporr, J. H. van pER MERWE, 
and H. Wrisporr, [ Phi. Mag. 43 (1952) 632) have recently obtained experi- 
mental evidence supporting the theory of dislocations in slip bands. With 
the aid of an electron microscope they measured the spacing, length, and 
amount of glide, of individual fine slip lines formed in lightly strained alu- 
minium. The number of dislocations in an elementary slip line was calculated 
from these measurements, and it was shown that the observed spacing of the 
lines is comparable with the theoretical critical distance below which rows 
of dislocations in parallel slip planes cannot pass each other. 


OBSTRUCTION OF DISLOCATIONS AT GRAIN BOUNDARIES 


As a dislocation approaches a grain boundary it may be subjected to 
three forces, one long-range and two short-range ones. The long-range 
force occurs where there is a change of elastic constants beyond the 
boundary, for example, where the boundary is a free surface, or where 
the boundary is between two phases of an alloy, or between two crystals 
in an elastically anisotropic material. The force occurs because, as the 
dislocation approaches the boundary, more of its strain field is imposed 
on the material beyond, and the energy to deform this is different 
from that to deform the material on the near side of the boundary. The 
only case that has been studied in detail is that of a free surface. 
KOEHLER” showed that a dislocation is attracted towards this by an 
“image” force inversely proportional to the distance from the surface. 
When applying this result in practice it should be remembered that 
many surfaces are covered with films, e.g. oxide coats on metals. The 
presence of an adherent film turns the boundary of the crystal into an 
interface between phases, and this will generally repel dislocations 
approaching it from inside the crystal. There is much experimental 
evidence for the fact that oxide coats make slip more difficult in soft 
metal crystals.’ A striking indication of the pressure of piled up 
dislocations against such oxide coats has been given by Barretr ;”* he 
showed that if the oxide coat were removed from a zinc crystal that had 
recently been twisted plastically, the crystal would spontaneously twist 
a little further as the piled-up dislocations were allowed to run out. 
The two short-range forces acting on a dislocation approaching a 
boundary are that due to the stress field of the dislocations in a boundary 
between different crystals, and one due to the fact that the law of 
force between atoms in the slip plane changes at the boundary. This 
latter is an example of a non-Hookean interaction and we shall use it to 
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illustrate the general theory of such interactions, developed by 
Jaswon and Foreman.” 

Suppose in Fig. 16 that a dislocation in the slip plane PQ is approach- 
ing a boundary AB beyond which the law of force between the atoms 
in the slip plane is different. In fact QR need not be a real slip plane 
at all, but merely a geometrical continuation into the region beyond the 
boundary. To make a definite picture, suppose that AB is the boundary 
between two elastically isotropic crystals that differ only in orientation 
from one another. Then for small strains the law of force between the 
atoms across the slip plane is the same in PQ and QR, and is Hooke’s 
law o,, = ud/d, where o,, is the shear stress, ¢ is the shear displacement 
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Fig. 16. A dislocation approaching a boundary 


and d is the spacing of the planes. For large strains the force is larger 
in QR than in PQ, since YR is not physically suitable for slip. 

At some distance from it the dislocation produces strains in the 
boundary that are small enough for Hooke’s law to be obeyed. There 
is no Hookean distinction between PQ and QR, so that no force is 
caused by the change in the nature of the slip plane until the disloca- 
tion is close enough for Hooke’s law to be no longer obeyed at the 
boundary. This non-Hookean force is thus a short-range one. 

JaSWON and FOREMAN give a theorem for calculating s, the distance 
to which a dislocation can be pushed towards the boundary by an 
applied stress o. Later, NaBARRO’®” showed that this could be derived 
rather simply by applying the principle of virtual work. Consider a 
piece of slip plane lying between x and x + dz from the dislocation, 
and denote the energy in this piece by g(¢)dz when z < s, and by 
h(¢)dx when x > s. The shear displacement ¢ depends on 2, and the 
laws of force in the two regions of the slip plane are o,, = — dg/d¢d and 
o,, = — dh/dd, respectively. Let the entire field of displacement 
representing the dislocation be moved forward rigidly by a small 
distance dx. The strain energy in the (Hookean) regions outside the 
slip plane is not altered, while the energy in the slip plane is altered by 
the same amount as if the boundary AB were moved from s to s — dz, 
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This energy change is {h(¢,) — g(¢,)}dz, where ¢, is the value of ¢ at s. 
The work done by the applied stress in moving the dislocation is 
obéx per unit length. Hence the condition for equilibrium is 


ob = hid.) — g(¢,). ess Cae 


This is an equation for s and to solve it we have first to choose suitable 
functions for g(¢) and A(¢é). Jaswon and ForeEMAN use the Peierls 
sinusoidal law (T.D.1, page 90) for g and Hooke’s law for h. Next we 
have to determine ¢, as a function of s; the simplest procedure here is 
to take an average of the values for 4 obtained by assuming in turn 
that g(¢) or A(¢d) is valid throughout the whole slip plane. 

Using the above laws of force Jaswon and Foreman show that the 
stress needed to hold the dislocation at s is given approximately by 





o = (u/6n*)(B/s)*, Ss bs ae, 





where § = 6/2(1— y¥). The short-range character of this interaction, 
varying as s~*, is to be noted. As a result, the stress needed to hold a 
dislocation at 10 atom spacings from the boundary is only 10-‘u, 
whereas that needed to hold it at 1 atom spacing is nearly as large as 
the theoretical shear strength of the lattice, 4/30. 









MeEcHANICAL HYSTERESIS IN POLYCRYSTALS 





One of the effects of the obstruction of dislocations at grain boundaries 
is that a polycrystal should show mechanical hysteresis. Other 
obstacles such as mosaic boundaries, oxide films, and stacking faults, 
may act in the same way and much of the following discussion can also 
be applied to them. 

For simplicity we suppose that the applied stress is only sufficient 
to produce slip in a few favourably oriented grains, although the 
theory can also be applied to larger deformations if attention is con- 
centrated on the excess of slip in one grain over that in its neighbours. 
When the proportion of plastic grains is small it is fair to assume that 
each of these behaves as if it were mounted in an elastic matrix. Under 
these circumstances the maximum slip it can undergo is that which 
relaxes the stress in it from the initial value o to some lower one ¢,, 
which is the critical stress below which dislocations are no longer 
mobile. If the linear dimension of the grain is L the total slip displace- 
ment in it is of order Lio — o,)/u, and if the proportion of such grains 
in the specimen is « the overall strain produced by slip in them is of 
the order 



















alo — o,)/ mM, cece OO 






when «a < l. 





The stress c, takes account of the various obstacles, such as foreign 
atoms and precipitated particles, that a dislocation has to overcome 
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when gliding through its slip plane. We can regard this stress as acting 
like a frictional force which prevents a dislocation from moving in either 
direction unless the applied stress is greater than it. 

This model has been used by Naparro in the following way to 
discuss mechanical hysteresis.*° Suppose that the applied stress is 
increased from zero towards a value o which exceeds o, in a few of the 
grains. At first the deformation is purely elastic. When the stress 
reaches c, dislocations begin to move and pile up at the boundaries of 
the plastic grains, their mutual repulsion producing a back stress a,. 
At the applied stress o dislocations move up to the boundaries until the 
back stress acting on each of them reaches the value given by 
6, = a0—0,. Plastic flow then stops. On unloading no plastic flow can 
occur until the applied stress is reduced by 2c,, so that the unloading 
curve starts off as a straight line. When the applied stress is reduced 
to o — 2c, the dislocations begin to move backwards into the centre of 
the grain, giving a reversed plastic flow. This becomes possible because 
the actual stress acting in this direction on each one of them is now 
o,— (o — 20,) which, since o — o, = @,, is equal to a,. 

When o > 2c, the applied stress o — 2c, is positive, so that reversed 
plastic flow begins before the specimen has been fully unloaded. 
On the other hand, when o < 2c, the specimen unloads completely 
without undergoing reversed plastic flow. Two effects can then be 
observed. If the applied stress is taken through zero and increased in 
the reverse direction, the elastic limit at which flow begins in this 
direction is 2¢,— 0, which is numerically smaller than the original 
elastic limit, c,. This is the Bauschinger effect. Instead of loading in the 
reverse direction, the specimen can be rested in the unloaded state. 
The stress co, (= o— ,) is then not strong enough to overcome the 
frictional stress o, due to the obstacles in the slip plane, but even so the 
dislocations may occasionally surmount these obstacles with the aid of 
thermal agitation. This is an example of the elastic after-effect. 

KvHLMaNn™ has shown that this kind of model can be used to 
explain why the residual strain in the after-effect changes logarith- 
mically with time. The stress c, depends on the number n of dislocations 
piled against the boundary. The activation energy for the return of a 
dislocation is a function of ¢, — ¢, and hence of n; for small changes in 
n it can be written in the form W — fn, where W and £ are constants. 
The rate of change of n due to the movement of dislocations back into 
the grains is dn/dt = — A exp {— (W — fn)/kT'}, where A is a constant. 
This integrates to give 

n ne — (kT'/B) log (1 + t/ty), od a 
where n, is the initial value of n and ¢, is defined by 
(-T'/B) log (SAt, kT’). 
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The residual strain is proportional to n and hence depends logarithmi- 
cally on time. 






DISLOCATION JOGS AND THE CREATION OF VACANCIES 






We turn now to the third kind of interaction involving dislocations, the 
geometrical interaction which occurs when two non-parallel dislocation 
lines cut through each other. Herpgenreicn and SHocKLEY” pointed 
out that this would be a source of hardening in crystals. 

Consider two straight dislocation lines A and B, as in Fig. 17 (a). 
Suppose that dislocation A glides in its plane and cuts through B, as 










The formation of dislocation jogs 





in diagram (b). Then on each dislocation a jog will be formed, at P and 
Q. For instance, the part of B above the slip plane of A is moved, being 
carried along in the top half-crystal, relative to the part below the plane 
when the dislocation A passes through. The direction and magnitude 
of this displacement are equal to the Burgers vector of dislocation A. 
Since the Burgers vector must be constant along a single dislocation 
line, and since this line cannot end within the crystal (T.D.1, page 80), 
the displaced ends of dislocation B are necessarily connected by a 
short piece of dislocation, the jog, which lies in the slip plane of A and 
has the same Burgers vector as B. To summarize, a jog is a dislocation 
with the same Burgers vector as the dislocation line which contains it, 
while the length and direction of the line of the jog is Burgers vector of 
the dislocation that has cut through. 

A jog will usually be one atom long. Now the energy of a dislocation 
is about ub® per atom plane (T.D.1, page 116), so that the energy of a 
jog is aub*, where «=~ 1. To calculate the stress needed to make a 
jog consider the situation shown in Fig. 18, where a dislocation line is 
about to cut through a row of equidistant dislocations spaced / apart. 
If this dislocation moves forward a distance of, say, 6, i.e. one atom 
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spacing, it cuts through all the intersecting dislocations and makes 1/l 
jogs in unit length. The dislocation energy is increased by ayb?/l, 
but the work gained from the applied stress o is ob*. Equating these, 
the stress needed to make the jogs is 


aub l. ees 


so that if the density of intersecting dislocations is p, the stress needed 
to drive a dislocation through them is 
aubp**. cs ee 


Taking p = 10" cm~* for a heavily worked material (T.D.1, page 104), 
and 6 = 3 x 10-* cm, this is «4/50. With « = 1 (which may be an 
overestimate) this stress is about ten times greater than that predicted 
D fr f f- 
YAY \Y SL 
’ ’ ’ ’ 


Fig. 18. A dislocation cutting through a row of dislocations 


from Taylor’s theory of work hardening (T.D.1, page 106). Taylor’s 
theory also leads to the relation (33) for the stress, but with an « of the 
order of 0-1. 

If the applied stress is less than ayb/l the dislocation can still go 
forward with the aid of thermal fluctuations. The activation energy to 
make a jog is 

U = apub® — ob?l. sw eee 


There are two points about this: (a) since «ub* is only a few electron 
volts (T.D.1, page 117) the activation energy is always fairly small, 
even at low stresses, so that there should be an appreciable temperature 
dependence for the hardening caused by intersecting dislocations; (0) 
the activation energy varies linearly with stress, a relation that is 
needed to explain some creep results described later; NABARRO** has 
shown that hardly any other process involving dislocations can give 
this linear relation. The significance of these effects for work hardening 
and creep will be discussed later. 

We must now consider whether a jog is an impediment to the move- 
ment of the dislocation containing it. Jogs in edge dislocations should 
not have much effect. The interesting case, first pointed out by 
THORNTON ReaD and later discussed by Srrrz**: *° and Mortt,* is that 
where two screw dislocations cross each other. If a screw dislocation 
contains a jog that is perpendicular to the screw axis then this jog is 
a piece of edge dislocation, since its line and its Burgers vector are 
perpendicular to each other. These two directions define a plane, the 
slip plane of the edge dislocation, in which it can glide easily. But this 
implies a movement of the jog along the screw axis and not a movement 
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with the screw dislocation itself. Any movement of the screw, with the 
jog, in the direction perpendicular to the slip plane of the jog, involves 
either lengthening or shortening the half-plane (T.D.1, page 86) 
belonging to the edge dislocation in the jog; this can only occur by the 
jog leaving behind itself a row of vacant atomic sites, or of atoms 
misplaced into interstitial positions, as it moves along with the screw. 
If the temperature is high enough for self-diffusion, or if the jog is 
made with sudden violence, these vacancies or interstitial atoms will be 
dispersed into the surrounding crystal. 

The purely geometrical basis of this effect can be understood with the 
aid of Fig. 19. Suppose in diagram (a) that slip is to occur along the 
lines AB and CD. Normally one would slip before the other, as in 


Fig. 19. Forming vacancies by slip on two planes 


diagram (6), where AB has slipped before CD. However, if they both 
slip at the same time, and in the directions shown in diagram (c), a 
hole must. be formed at the centre, where they cross. The essential 
condition to make the slip processes on these two planes overlap in 
time is that they occur by the movement of screw dislocations, one in 
each plane, in the crystal at the same time. 

The generation of vacancies or interstitial atoms from a jog pulled 
along with a dislocation is of interest, and Serrz**: ** has discussed this 
and some other processes for creating these defects. A jog in an edge 
dislocation may occasionally create a defect by a process illustrated 
in Fig. 20. Here CDE represents an edge dislocation with a jog at D, 
the half-plane being ABCDE, moving in the direction shown. The 
line CD can be regarded as an incomplete row of atoms attached to the 
lower edge of an otherwise perfect half-plane. The end of the row D is 
clearly a place where an atom is likely to become detached from the 
half-plane and left behind in an interstitial position as the dislocation 
moves on, or alternatively is picked up from the lattice and added to 
the row, in which case a vacant site is left behind. In close-packed 
metal lattices, where an interstitial atom has more energy than a 
vacancy, it is likely that the second of these processes will operate 
mostly, so that the incomplete row CD will gradually become longer and 
most of the lattice defects created will be vacancies. 

Serrz**. *° and Morr® have summarized the experimental evidence 
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for the generation of lattice defects by cold work. The main items are 
as follows : 


1. GyvuLal and Hartiy® showed that the electrical conductivity 
of sodium chloride is increased about 100 times after 10 per cent 
plastic strain. This conductivity is believed to be due to 
vacancies and decreases on subsequent ageing at a rate consis- 
tent with this hypothesis. 

MoLEeNA4R and AarTs® strained polycrystals of copper, silver, 
and aluminium, at the temperature of liquid air and measured 
A 


Fig. 20. A jog D in an edge dislocation CE 


the increase of resistivity caused by this. Ageing at room 
temperature removed nearly one-half of this increase without 
altering the stress-strain curve. SerITz believes that this 
recoverable part of the resistance is due to vacancies created 
during the cold working process, and that it disappears when 
these condense into aggregates or at jogs in edge dislocations. 
From the magnitude of the resistance change he estimates that 
10 per cent plastic strain creates about 10"* vacancies per c.c. 
DuGpaLe*’ has shown that the resistivity of platinum is increased 
about 0-5 per cent by fast neutron bombardment, an effect 
which is almost certainly caused by the creation of vacancies 
and interstitial atoms. This recovers on annealing at the same 
rate as, and with the same activation energy (1-1 eV’) as, the 
recoverable resistance caused by cold work. 


THEORY OF THE YIELD POINT AND STRAIN AGEING 
It was shown in T.D.1 that the stresses round a dislocation can be 
relieved by a suitable arrangement of nearby solute atoms, and that a 
solute atom with polar co-ordinates r, 6 relative to a positive edge 
dislocation has an interaction energy 
V =A sin 6/r 25 & eee 
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with it. Here A is a constant that depends on the elastic properties 
of the material, on the degree of misfit of the solute atom, and on the 
strength of the dislocation. This formula breaks down at the centre of 
the dislocation, where Hooke’s law does not hold, and it is expected that 
a large solute atom is most strongly bound at the position 6 = 37/2 and 
r(= 1) ~ 2 « 10-%cm which lies immediately below the half-plane 
of the dislocation. For carbon and nitrogen in «-iron one obtains* 
A=~3 x 10-*%dyne cm?* and, at the position of greatest binding, 
V (= Tod Gs Lev. 

An approximate experimental value can be deduced from the work of 
DisksTra.” Using an internal friction method developed by Snozrk” 
he measured the solubility of nitrogen in iron in the presence of numer- 
ous dislocations introduced by cold work. At 300°C this was 0-0025 
weight per cent as compared with 0-012 weight per cent for unworked 
material, this latter being the solubility in the presence of iron nitride 
particles. According to DisksTRa the binding energy of nitrogen in iron 


nitride is 0-35 eV, so that V,,,, in dislocations is somewhat greater than 


this, say about 0-5eV. Another method of estimating V,,,, which also 
gives 0-5 eV is discussed below. 

This interaction can lead to two kinds of segregations of solute atoms 
round dislocations. The first is the dilute ““atmosphere’’ or Maxwellian 


distribution, defined by ¢ = t exp (— V/ET), 2 ee (36) 


where c is the concentration in a region with a binding energy V and cy, 
is the overall concentration. This expression is valid when c is small, 
so that the volume change produced by the segregation is small com- 
pared with that produced by the dislocation. As the temperature falls 
this atmosphere becomes more concentrated and below a critical tem- 
perature range it condenses into a line of solute atoms, lying along the 
dislocation line at the position of maximum binding.* 

An estimate of the temperature 7', below which condensation should 
occur (if diffusion is still possible) can be made as follows.** Suppose 
that there is no shortage of solute atoms for dislocations, i.e. that the 
number c,N in unit volume of the crystal is large compared with p/d, 
the number of sites to be filled along the lines of maximum binding 
in the dislocations; here N is the total number of atoms in unit volume 
of crystal, p is the density of dislocations, and 6 is the atomic spacing. 
Since p/bN = pb*, which is about 10-* when p = 10" cm~ and 10-’ 
when p 10°? cm-*, this condition is satisfied except in the most 
dilute solutions. Then under these circumstances equation (36) can be 
used to estimate 7,. At the point of maximum binding we have 
V = V,,,, and require that the temperature 7’, should be such as to 
give c = | at this point. Hence 


T, = | k log (1/c,) 


max 
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Mott has used this to estimate V’,,,.. He argues that the yield point 
of iron is not observed above about 700°K, and that this temperature 
should be identified with 7’, since a condensed atmosphere is needed 
for a strong yield point ;** then by taking cy ~ 10-* he concludes that 
Vmax = 0°5 eV for carbon in iron. 

The stress needed to pull a dislocation away from its atmosphere 
depends on whether this is condensed or not. With a condensed one the 
interaction energy per unit length, for a dislocation moved a distance 
x from its line of solute, is 


V(x) = — Ar,/(x* + 1,52). 2 a ae 


The corresponding force on the dislocation, — dV /dz, has a maximum 


at x = r,/V/3 equivalent to a (tensile) yield stress 


Oy = A/b?r,?. a 


For carbon and nitrogen in iron this is 6 x 10% dyne cm~*. An extra- 
polation®* of observed strengths to 0°K gives 1-25 x 10! dyne cm’. 
It is expected that the observed value should be lower, since yielding 
in iron is known to begin at places at high stress concentration, and to 
spread from these through the rest of the specimen in the form of 
Liider’s bands. 

A rough estimate of the stress needed to pull a dislocation away 
from a Maxwellian distribution can be made as follows. The quantity 
A/kT (= 1) has the dimension of length and, since the density of the 
atmosphere is exp [— (l/r) sin 6], can be regarded crudely as the ‘‘radius”’ 
of the atmosphere. We expect that the force on the dislocation from its 
atmosphere is a maximum when the dislocation is displaced by a dis- 
tance of about / from the centre. All the solute atoms in an area of 
order /* are then pulling it back. The force from one of these is A/r*, and 
we shall take an average value //2 for r. The number of solute atoms in 
the area is of order /*c,N exp (l/r) ~ 7l*c,N, per unit length of dis- 
location. The critical stress for the breakaway is then 


Oy = (1/b)(4A/I*)(7l*egN) ~ 28AcyN/b. . . . . (40) 


A more accurate calculation” leads to the same result as this, but with 
a numerical factor of 17 in place of 28. Since N is of order 1/6*, or 
1 /br,”, this critical stress is essentially the same as that of equation (39), 
but multiplied by a factor of order 10cy. The presence of cy means that 
in dilute solutions the yield stress for a Maxwellian atmosphere is far 
smaller than that for a condensed one. It is the absence of cy in equation 
(39) that makes the yield point for a condensed atmosphere so strong 
even in very dilute solutions. We suppose that the rather exceptional 
behaviour of iron has its origin here; that room temperature is well 
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below 7’, in this case but above 7’, for those metals which do not show 
the vield phenomenon. 

A feature of the theory is that the yield point should depend strongly 
on temperature. There are three effects : 

(a) As the temperature is increased through the range of 7’, the yield 
stress changes from the value given in equation (39) to that in 
equation (4). 

b) Because the atmosphere, and particularly the condensed one, is 
highly localized round the dislocation, only a small displacement of a 
small length of the dislocation line is needed to create a nucleus of 





mek 
ee 








Fig. 21. Yielding in a single crystal of 8-brass. (1) first test; 
immediately reloaded; (3) after strain ageing for two 
hours at 250°C (after ARDLEY) 


freed dislocation sufficient to enable the applied stress to drag away 
the rest of the dislocation; the energy needed for such a nucleus is 
smal! enough for thermal fluctuations to make an important contribution 
to it.” 

c) Lomer® has pointed out that all the available sites in a condensed 
atmosphere are not filled because the configurational entropy is increased 
if a few remain empty. These empty sites, which weaken the anchor- 
ing of the dislocation by the atmosphere, increase in number as the 
temperature is raised. 

Fig. 21 shows an example of the yield phenomenon obtained in a 
single crystal of 8-brass grown in nitrogen.” It shows the characteristic 
sharp fall from the upper to the lower yield point, the absence of this 
on immediate retesting when the material is over-strained, and the 
return of it on strain ageing. Several examples of materials showing the 
yield phenomenon are now known. Soft polycrystalline iron or mild 
steel, containing a little carbon and nitrogen, is the familiar one. Yield 
points have now been obtained in single crystals of iron containing 
carbon or nitrogen, in some cases during the first test made on them,” 
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but in others only after strain-ageing treatments have been given.” % 
Examples amongst non-ferrous metals are polycrystalline molyb- 
denum,** and single crystals of zinc”: 1, cadmium™ 1,10 and 
a- and f-brass.* It is often not possible to demonstrate the existence of 
a yield point in a soft single crystal by means of the tensile test, because 
of difficulties in avoiding stress concentrations, but it can be proved by 
means of a bend test. 

The fact that, in iron, larger yield points are obtained in poly- 
crystals than in single crystals, and in fine grained polycrystals than in 
coarse grained ones, has led some authorities to suggest that the yield 
point in iron is caused by films of hard material, probably rich in 
carbon, that encase the grains and prevent the soft metal inside them 
from flowing until, at the upper yield point, they break and release the 
plastic flow. The difficulty with this view is to see how it can lead to 
an appreciable fall of stress at the yield point, and why in practice 
there is such a small amount of plastic deformation before the upper 
yield point is reached.!, 1° 

A qualitative explanation can be based on the dislocation theory,’ 
as follows. There are three important stresses to be considered: (a) the 
observed upper yield stress c,, (b) the theoretical breakaway stress o, 
(at the temperature of the experiment) for an anchored dislocation, 
(c) the applied stress o,/¢ at which dislocations begin to break away in 
regions of severe stress concentration g. Several experimental facts can 
be correlated’ on the basis that o,/¢ < o, < o». Yielding begins at a 
few places of high stress concentration, before the upper yield point is 
reached, and creates small regions in which plastic flow has begun while 
the rest of the specimen still remains elastic. The spread of these 
nuclei of yield is resisted by obstacles in the path of the freed dislocations 
in them, the most important obstacles being grain boundaries. The 
upper yield is the stress at which one of these nuclei first becomes able 
to spread into the rest of the material. 

Starting from equation (35) for the interaction energy, a calculation 
can be made of the rate at which solute atoms segregate to a free 
dislocation during strain ageing. The expression obtained by CoTTRELL 
and Bripy® for this is 

n(t) an (ADtikT)*”, i= ia 


where n(t) is the number of atoms segregated in time ¢ to unit length 
of a dislocation in a region where the initial number of atoms in solution 
is n, per unit volume; J is the diffusion coefficient for these atoms and 


a is a number nearly equal to 3. The physical basis of this formula 
can be understood as follows. The gradient of the interaction energy 
provides a force attracting a solute atom to a dislocation and, migrating 
by thermal agitation, the atom acquires a drift velocity v proportional 
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to it. Since V varies as r~', where r is the distance from the dislocation, 
the gradient of V and the velocity v vary as r~*. Thus atoms starting 
at a distance r from the dislocation take a time ¢ proportional to r/v, 
i.e. to r*, to reach it. The number that reach the dislocation in a time 
less than this is proportional to the initial number at all distances less 
than r, i.e. is proportional to r*, and hence to & 








Fig. 22. The effect of temperature on the rate of strain ageing 
I 
in iron containing carbon (after HARPER) 


The actual rate of strain ageing should become slower during the 
later stages than equation (41) indicates because the dislocations 
become saturated with solute atoms. Harper’ made a simple generali- 
zation to include such effects by supposing that the rate of segregation 
decreases in proportion with the amount segregated. This gives for /, 
the fraction segregated, 


f exp [— ap(A Dt/kT’)**), ‘ele 


where p is the density of dislocations. Fig. 22, taken from Harper’s 
internal friction measurements of the strain-ageing process, shows that, 
for the segregation of carbon in cold worked iron, log (1 — f) varies 
linearly with &° as the theory requires. From the slopes of these lines 


the activation energy is found to be 20,000 cal/mol, in agreement with 
the value for diffusion of carbon in ferrite. By fitting equation (42) to 
the experimental results the density of dislocations can be found. The 
values Harper obtained, e.g. 2-5 x 10" cm~ after 10 per cent strain, 
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agree with those of CoTTRELL and CuurcumaNn,”” who followed the 
strain-ageing process by an electrical resistance method. 

These experiments show that the amount of carbon or nitrogen 
transferred to dislocations during strain ageing in moderately worked 
iron is of the order of 10-* to 10-* weight per cent. But the equilibrium 
solubility of these elements at room temperature is much smaller than 
this, of the order of 10~-’ weight per cent for carbon and 10-° weight per 
cent for nitrogen.*”.™° This means that either the solution is super- 
saturated when the strain-ageing experiments are made or that some 
particles of carbide and nitride redissolve in order to feed the disloca- 
tions. Probably both of these occur in practice. It is known*®*: 4° that 
precipitation in unworked iron becomes extremely slow when the 
amount left in solution is no more than 10-* weight per cent, and that 
the metal remains supersaturated to this extent even after one month at 
room temperature. This is probably a result of the higher solution 
pressure of finely dispersed precipitates. A recent study™ of these 
effects has shown that heat treatments designed to make it difficult for 
dislocations to obtain carbon and nitrogen (e.g. prolonged ageing before 
straining in order to make the precipitates coarse and reduce the super- 
saturation) can slow down the rate of strain ageing by an order of 
magnitude. It was found that the initial rate was closely proportional 
to the concentration of carbon and nitrogen in solution, as equation (41) 


requires. 


Work HARDENING 


To construct a complete theory of.work hardening three steps are 


necessary : 
1. The adoption of some model to represent the structure of the cold 
worked crystal. 
2. The calculation of the stress needed to enforce slip in such a 


structure. 
3. The calculation of the strain needed to produce the structure. All 


three have to be taken if the stress-strain curve is to be calculated. The 
second step is the only one that is even moderately easy. Most theories 
have failed at the first one because the model adopted has not agreed 
with observed cold-worked structures. So far as the writer is aware, 
no one has ever accomplished the third step without the aid of an ad hoc 
assumption. Early theories, some of which were reviewed in T.D.1, 
achieved comprehensiveness by the adoption of a highly artificial 
model, but later ones have usually preferred to explain less in order to 
keep the model as realistic as possible. 

The simple picture of slip is one of translation of thin parallel sheets 
relative to one another along a fixed direction. On such a picture it 
has always been a puzzle to understand why these sheets should not 
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slip right off one another once they have started to slide, i.e. why 
there should be any work hardening. The recent tendency is to uphold 
the primitive notion from which this difficulty springs; to believe that 
in this simple “‘laminar” slip there is in fact little or no work hardening 
and that the intense work hardening so often observed in practice is 
caused by more complex or “turbulent” slip processes involving 
inhomogeneous lattice rotations and slip on intersecting planes. 

The main evidence for this view is the striking difference in the work 
hardening of crystals of hexagonal and cubic metals. Many shear 
stress-strain curves of crystals deformed in tension can be approximated 
in the plastic range by the relation 


y = (a/h)”, seo oe Ce 


where o is the increase in shear stress, above the initial yield value, 
needed to produce the shear strain y, and where m = | for hexagonal 
metals and 2 for cubic ones. The parameter / is the coefficient of work 
hardening; at room temperature it is about 4 x 10° dyne cm-? for 
aluminium, 5 x 10’ for magnesium, and 2-5 x: 10’ for zine. The 
difference between metals of cubic and hexagonal structures does not 
stop at this. With zinc and cadmium crystals the slip bands all appear 
early in the deformation, whereas with aluminium they increase in 
proportion with the applied stress. In a detailed study of tensile defor- 
mation HonEYcomBEe™ showed that x-ray Laue asterisms are observed 
from aluminium crystals (except those oriented to deform in multiple 
slip) but not from cadmium ones. In these aluminium crystals neither 
the work hardening nor the asterisms could be eliminated except by 
annealing at temperatures where recrystallization occurred. The 
cadmium crystals, however, would not recrystallize and the small 
amount of work hardening that could be introduced in them by tensile 
straining was quickly removed by mild annealing. 

Some of this evidence has been available for many years, but not 
until recently has it seemed feasible to draw the above conclusion from 
it. For it immediately leads to the following difficulty. Admittedly, a 
cubic crystal oriented to deform from the start by double slip may well 
behave differently from one of cadmium where double slip is not 
observed; but it is easy to obtain orientations where cubic crystals 
should deform at the start by single slip, and why do these behave 
differently from the hexagonal ones? TayLor™® (see also references 
35 and 114) showed that strain hardening in aluminium is more rapid 
when the crystal deforms from the start in double slip than in single 
slip, but this difference was only about 20 per cent and quite trivial 
when compared with the difference between cubic and hexagonal 
crystals. 

Again, the recent tendency is to uphold the notion upon which the 
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difficulty rests; to say that when a cubic crystal is made to deform in 
pure single slip it does so like a hexagonal one and shows little hardening. 
However, just as it is difficult to prevent fast laminar flow in a fluid 
from breaking up into turbulent flow, so it is difficult, with a cubic 
crystal, to prevent laminar slip from breaking up into turbulent slip. 
It is not sufficient merely to choose an initial orientation suitable for 
single slip. Other steps are also necessary and if these are not taken the 
crystal generally does not behave in the simple way expected of it. 





> 


kg wt/mm? 











ec 


Resolved Shear Stress 








Glide %, 


Fig. 23. Contrast between behaviour of a silver crystal (1) with a clean 

surface but exposed to air for 2 hours at room temperature, and (2) a 

similar crystal tested in argon. Both tested at room temperature (after 
ANDRADE and HENDERSON) 


The main evidence for this view is the effect known as “easy glide” ; 
under certain conditions the plastic part of the stress-strain curve of a 
face centred cubic crystal begins with a region where the hardening 
coefficient is abnormally small, or even zero, after which the curve rises 
steeply to give the intense work hardening typical of such crystals. The 
observations of this are as follows: 

(a) ANDRADE and HENDERSON" observed such regions in crystals of 
pure gold and silver tested in tension, and concluded that easy glide is 
favoured by high purity, complete absence of surface contamination 
(e.g. oxide films), and a low temperature of deformation. Fig. 23 shows 
the remarkable effect that exposure to air has upon the work hardening 
of silver crystals. 

(6) Mapprx, MaTHEwson and Hipparp® showed that some single 
crystals of «-brass deform on a single set of slip planes with practically 
no hardening until the stage is reached where double slip begins, when 
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hardening becomes intense. Vacant atomic sites should be produced in 
large numbers, by the crossing of screw dislocations, once double slip 
begins and there is some indirect evidence for this in the observations of 
Masma and Sacus,"* who showed that during single slip the change in 
electrical resistance and density was small but became substantial when 


double slip began. 
(c) Crystals of aluminium stretched in tension show easy glide pro- 
vided they are pure and have orientations which avoid double slip."’: 
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Stress-strain curves on pure aluminium crystals 
after LOckEe and LANGE) 


Fig. 24, taken from the work of Licks and Lanee,™* shows stress-strain 
curves on two crystals of pure aluminium (99-99 per cent), one of which 
(A) deformed initially by single slip and showed a region of easy glide 
in which hardening was very small, while the other (B) deformed by 
multiple slip and began hardening immediately. Mastne and RaFFret- 
SIEPER™’ showed that the region of easy glide became smaller the 
nearer the initial orientation was to one giving multiple slip, and that 
it was suppressed entirely in crystals of less pure aluminium (99-5 per 
cent); these gave roughly parabolic curves and intense work hardening 
for all initial orientations. 

d) In an attempt to avoid plastic bending at the grips of a specimen, 
OHM and KocHENDORFER” strained long crystals of pure aluminium 
(99-99 per cent) in a special shearing apparatus. They found that this 
produced simple laminar slip which resembled slip in hexagonal metals 
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in that it gave a linear stress-strain relation, a small work hardening 
which could be removed easily by mild annealing, and no x-ray 
asterisms. 

(e) Pratt’®° studied the deformation of sodium chloride crystals by 
means of special optical methods. Two kinds of stress-strain curves 
were obtained, steeply rising parabolic ones on crystals that deformed 
throughout on intersecting slip systems, and less steep and roughly 
linear ones on crystals in which, after a small initial period of double 
slip, only one slip system was active. 

The impression gained from these results is that a cubic crystal 
oriented for single slip will start to deform in this ideal manner and give 
little work hardening, as would be expected. However, sooner or later, 
something happens that breaks up the flow into a more turbulent kind 
that gives intense work hardening. By taking certain precautions one 
may delay the onset of this second stage, or even postpone it, but if these 
are not taken it begins so soon that the “easy glide’’ characteristic of 
ideal single slip is not observed. In some cases the onset of turbulent 
flow is delayed until double slip clearly begins (a-brass) but this is not 
true of aluminium stretched in tension, where it usually begins within 
the first few per cent extension; one cannot be sure, of course, that 
small amounts of slip on “‘unpredicted”’ planes do not occur during this 
stage. However, the structural feature which appears to be connected 
with the change in the nature of the flow in this case is the deformation 
band or kink band. HonrycomBe™ showed that these begin to appear 
after 1 per cent extension in crystals that deform by single slip, but not 
in those that deform by double slip. 

Detailed studies of deformation bands have been made by CauHn,*’ 
HoNEYCOMBE,”* CHEN and MaTHEwson,! and LaLogvur and Crvs- 
SsaRD.! Fig. 25 is an example of a band. It consists of a thin sheet 
extending across the crystal. At small strains the sheet is plane and 
normal to the active slip direction. It forms an obstacle to slip and 
many slip bands fail to penetrate it, the piled up dislocations in them 
producing the characteristic double bend of the lattice illustrated in 
Fig. 26. The intense stresses from these cause slip to occur on other slip 
systems in the band, and these intersecting slip bands become clearly 
visible after strains of about 20 per cent. The spacing between deforma- 
tion bands is commonly about 10-* to 10-'cm and the change in 
orientation across a band is sufficient to account for the x-ray Laue 
asterisms observed from crystals containing bands. Crystals which 
deform entirely in multiple slip develop the bands either not at all or 
on too fine a scale to be seen. 

The process by which the bands form is still obscure. The two main 
views are that they start from bend planes which run into the crystal 
from the ends, having been formed by the lattice bending caused by 
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grip constraints,*’, 4*, 1,121 or that they grow by the successive 
trapping of dislocations moving in slip planes near to one containing 
an obstacle against which dislocations have piled up.™. ** Deformation 
bands bear a geometrical resemblance to the kinks produced in hexa- 
gonal metal crystals by compression along an axis in the basal plane. 
FRaNK'™ has suggested that macroscopic kinks can grow from short ones 
because the stress concentration at the end of an incomplete wall of edge 
dislocations can rise sufficiently to create new dislocations in the perfect 
lattice beyond it. A wall of edge dislocations which extends part of the 
way through a crystal is equivalent to a wedge, of atomic sharpness at 


Fig. 26. Piling of dislocations against a deformation band (a) before 
and (5) after polygonization 


its edge and composed of the half-planes of these dislocations, which is 
driven into the crystal. 

The above discussion points to the conclusion that strong work 
hardening in cubic metals originates in the interactions among dis- 
locations on intersecting slip bands. The arguments for this are as 
follows 

a) This kind of hardening is only observed in crystals capable of 
slipping on intersecting planes. 

5) It is only observed in these crystals under circumstances that 
favour slip on intersecting systems, i.e. when the orientation is suitable 
for multiple slip, or when deformation bands are present. 

(c) It does not seem possible to achieve the stability which the rela- 
tive permanence of the cold worked state requires by means of purely 
Hookean interactions amongst slip dislocations in parallel planes. A 
dislocation in an array such as those considered by TayLor and 
Korner (T.D.1, page 105) is at rest only so long as the net shear stress 
acting on it is zero. It is difficult to design arrays for which this stress is 
precisely zero on all dislocations in both the presence and absence of 
applied stress, and most arrays that one might expect to form turn out 
on analysis to be mechanically unstable unless some of their dislocations 


are permanently fixed in position and not merely allowed to rest because 
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Fig. 25. A deformation band in aluminium: crystal 


extended 20 per cent (after CAHN) 250 
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no shear stress acts on them. It is not necessary that all dislocations 
should be anchored, and the elastic after-effect described earlier shows 
that some of them are in fact mobile. What is required is that some 
dislocations in key positions should become incapable of moving and 
then act as obstacles confining the other dislocations, which are stil] 
mobile, to move within restricted areas of their slip planes. 

Two mechanisms involving slip on intersecting planes which would 
provide such locking have been suggested. The first is that of Lomer 
and CoTTRELL, described earlier, in which slip dislocations are turned 
into sessile ones by combination with slip dislocations on intersecting 
planes. The other mechanism, also described earlier, is that which 
leads to the formation of jogs. A dislocation may be able to glide freely 
in one slip plane but nevertheless will be an obstacle to dislocations 
moving on intersecting planes if, to be carried along with these, it has to 
glide out of its plane. A row of such dislocations, as shown in Fig. 18, 
holding up slip in an active plane provides a suitable obstacle for work 
hardening. Furthermore, as we have noted earlier, the hardening 
caused by such obstacles is sensitive to temperature, and this is observed 
in practice. The density of intersecting dislocations needed to account 
for the observed hardening in heavily strained crystals is in the range 
10° to 10% cm-*, and there is some independent evidence for such 
densities (T.D.1, page 104, also references 108 and 109). 

Hardening due to the mutual interference of dislocations in different 
slip planes should, of course, be most severe in crystals oriented for 
multiple slip. However, it should not be thought that this kind of 
hardening is absent in a crystal which shows only one set of visible slip 
bands. What matters is the density of dislocations in intersecting slip 
planes and this is not the same as the total amount of slip on these planes. 
This point of view is brought out in a theory which is presented below 
Even with an orientation suited for double slip there is a tendency to 
develop a patchwork grouping of regions in which slip is predominantly 
on one plane, as HoNEYCOMBE™* showed with his “‘bands of secondary 
slip.” Such a clustering of parallel slip planes into packets is to be 
expected since the array of similar dislocations built up in a cluster is 
difficult to penetrate by dislocations approaching along a different 
family of slip planes. 

The ideas outlined above provide the elements of a theory of harden- 
ing for turbulent slip, but not for laminar slip, the theory of which has 
still to be developed. Various suggestions have been put forward by 
KUHLMANN,™ Serrz, Orowan,” and Morr®. * which may prove 
important for this. An attempt to develop the theory of turbulent 
hardening quantitatively has been made by Mortt,* whose theory will 
now be described. 

The theory is two-dimensional and based on the piling up of edge 
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dislocations against obstacles spaced at intervals of 2Z along their slip 
planes. At each obstacle a group of n dislocations of the same sign is 
piled up and anchored by combination with a few dislocations on inter- 
secting planes which have been forced into activity to relieve the high 
stresses round the group. Any slip on secondary planes other than these 
small and highly localized movements is not essential to the theory. 
Each group acts at large distances like a “‘super-dislocation”’ of strength 
nb, and its stress field reaches n times as far, at the same intensity, as 
that of a unit dislocation. This is held to account for the wide spacing 
between the slip bands, such as is observed in aluminium.** The picture 
of work hardening which emerges is similar to that of Taytor (T.D.1) 
but with the following main differences: (a) the dislocations are 
anchored, (5) the strength of each dislocation is n times larger, (c) the 
spacing of them is correspondingly larger, (d) the dislocations are not 
necessarily arranged in any regular array. Let the distance between the 
active slip planes be d. Then the density of the “super-dislocations”’ is 
(Ld)-", and the mean distance between them is (Zd)'*. Each exists in a 
field of internal stress due to its neighbours given approximately by 


am u(nb) ; 
/* 22(Ld)** 


(44) 


The plastic strain is p(nb)L, or, substituting p = (Ld)-", 
y = nbjd. ig eae 


Eliminating d from these relations gives a parabolic relation between 
o, and y, 
o, = (u/2nr)(nyb/L)**, 5-4 en 


similar to that of TayLor’s theory (T.D.1, equation (23)). 

To eliminate the somewhat arbitary parameter L in this relation, 
Morr introduces his idea of the dynamic generation of loops at a 
Frank—Read source. The first time a source of length / is made to work, 
a stress Ac = ub/l is needed, in addition to the frictional stresses due 
to impurities etc, but thereafter the source works spontaneously until 
the back-stress on it has cancelled this stress. The number n of dis- 
locations needed to do this from a distance L is given by Ao = nub/2rL, 
which implies, since Ao =~ ub/l, that n/L ~ 27/1. Substituting this in 
the stress-strain relation gives 


a, = w(yb/2nl). a 


' 


Mort emphasizes three things about this result : 
(a) When / is constant it gives a parabolic relation between stress and 
strain, as is observed. 
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(6) When 1 ~ 10-* cm, which is reasonable, the coefficient of work 
hardening is of the right order of magnitude. 

(c) The stress-strain relation does not depend on L, the spacing of 
the obstacles. This is important since the intensity of turbulent work 
hardening is roughly the same in crystals of different orientations. 

To account for the dependence of work hardening on temperature, 
Mort takes up an idea due to Brown,” and suggests that vacancies 
created in an active slip plane during plastic flow migrate into neigh- 
bouring planes, up to 100 atoms away, and soften these sufficiently to 
cause them to start slipping. This is also held to account for the stepped 


Fig. 27. Formation of steps on a slip band 


structure of slip bands observed by HEIDENREICH and SHOCKLEY,” and 
Brown.** However, alternative explanations are possible. We have 
seen that the hardening caused by intersecting dislocations is sensitive 
to temperature. As regards the formation of steps in slip bands, the 
suggestion has been put forward independently by FisHER*™ and the 
writer that these are caused by the displacement of outlying parts of 
active slip planes, relative to the central ones, as a result of slip on inter- 
secting planes. This process is illustrated in Fig. 27. Here a slip band 
ABC emerges on the surface at C. At an earlier stage in the deformation 
B’C’ was the continuation of the line AB of the band, but this part has 
now been moved up by slip on an intersecting plane DE. Since B’C’ 
is now cut off from the Frank—Read source generating the band no more 
slip can take place on this step. This effect of the cessation of slip on 
existing steps, when new ones are formed, has been observed by 
Brown,!” who calls it “sterile slip.” 


ANNEALING 
In T.D.1 a theory was given which explained why polygonization occurs 
in a plastically bent crystal, and why polygon boundaries are perpendi- 
cular to the active slip direction, from the elastic properties of edge 
dislocations. It did not explain how polygonization occurs, however. 
Two points are important here: (a) in the freshly bent crystal the 
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dislocations are grouped “‘horizontally”’ in slip planes, whereas after 
polygonization they are grouped “‘vertically”’ in walls; to produce this 
change they must climb out of their slip planes besides gliding along 
them; (6) polygonization only occurs rapidly at temperatures where 
self-diffusion is also rapid. As Morr* has pointed out, this means that 
polygonization must occur by the addition or removal of matter at jogs 
on the edges of the half-planes by atomic migration. The rate at which 
a jog gives off a vacancy can be written as c exp (— W/kT7'), where W is 
the activation energy for self-diffusion and where the frequency factor 
c contains a term proportional to the force pushing the edge dislocation 
out of its slip plane.“ If n/b is the number of jogs per unit length, 
where + is the atomic spacing along the edge, the velocity with which the 
dislocation climbs away from its slip plane is 


nbc exp (— W/kT’). ces. a 


As it climbs, the dislocation has to be replenished with jogs since 
the existing ones run to the end of the line. New jogs can be made by 
the action of thermal fluctuations, in which case the equilibrium density 
of jogs along the edge is given by the relation n = exp (— U/kT’), 
where U is the work of formation of a jog; the activation energy for 
climbing is then increased to U + W, and polygonization cannot occur 
rapidly except at high temperatures. Alternatively, the dislocation may 


glide slowly under an applied stress in its slip direction, while it is 
climbing and so generate its own jogs by cutting through intersecting 
screw dislocations. In this case the activation energy for climbing is 
only W, not U + W, and polygonization occurs at lower temperatures. 
This is perhaps why cell-structures, very reminiscent of complex 
polygonization boundaries, are formed during slow creep in poly- 
crystals at temperatures much lower (e.g. 250°C instead of 600°C) than 
are necessary to produce them by annealing after cold work.* The 
fact that these cell-structures have crystallographically irregular 
boundaries is not an argument against their being formed by poly- 
gonization, since the mutual constraints of neighbouring crystals force 
slip to occur by dislocations with more than one kind of Burgers vector ; 
SHOCKLEY and Reap*®® have suggested, and Lomer and Nyre® have 
confirmed with the bubble raft, that the equilibrium orientation of a 
small-angle boundary is always perpendicular to the combined Burgers 
vector of the dislocations in it. 

Apart from this progress in the theory of polygonization, the develop- 
ment of a structural theory of recovery has been held up while the 
nature of the cold-working process is being clarified. In the meantime, 
a formal theory of recovery has been developed by KunLMann*, 1*4 
and CoTTRELL and AYTEKIN,’”’ in which the basic idea is that the rate, 
— da/dt, at which the yield stress o recovers on annealing is controlled 
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by an activation energy, @ — fc, that depends linearly on the yield 
stress. An analysis similar to that used to derive equation (30) then 
shows that the yield stress decreases logarithmically with the annealing 
time, a relation which has been confirmed experimentally.!?’. 128. 19 

Recrystallization and grain growth both occur by the migration of a 
grain boundary in a direction normal to itself. The driving force in the 
first process is the difference in energy between the cold worked and 
annealed grains which meet at the boundary; in the second one it is 
the surface tension of the boundary. The velocity with which a boundary 
moves depends, of course, on the driving force, as the theories of Morr?*° 
and TURNBULL™ show, but evidence has been accumulating recently 
that the enormous range in the mobilities of different grain boundaries 
is due, not so much to different driving forces, but to differences in 
orientation. It is convenient here to distinguish four types of boundary : 

(1) The small-angle (e.g. 6 < 1°) simple Burgers boundary (Fig. 13), 
containing one family of parallel and uncontracted edge dislocations 
(see page 227). 

(2) As (1), but with larger change of angle (e.g. 5° < 6 < 20°) and 
contracted dislocations. 

(3) The small-angle or medium-angle unsymmetrical boundary con- 
taining dislocations with more than one kind of Burgers vector. 

(4) The large-angle (e.g. 6 ~ 30°) incoherent boundary across which 
there is no continuity along lattice rows, and which may be imagined 
in terms of a monomolecular layer of “‘liquid’’ between the crystals. 

The high mobility of type (1) boundaries has been demonstrated in 
the bend plane experiments of PaRKER and WasHBURN,®* in the bubble 
raft experiments of Lomer and Nye,® and in the observations of 
GuINIER and TENNEVIN,)* and Dunn and DanrEzs,” on polygoniza- 
tion. Type (2) boundaries, on the other hand, do not appear to be very 
mobile, as shown by the fact that the growth of polygonal domains 
becomes much slower when the angle between them increases.'** 
LomMER and Nyre® have argued that boundaries made from contracted 
dislocations should be immobile because the Peierls-Nabarro force 
(T.D.1, page 90) resisting the motion of a dislocation increases rapidly 
as the width of the dislocation is reduced. 

Boundaries of type (3) are often foundin worked polycrystals!**. 144, 135 
and there is much evidence for their immobility. TrepEmMa, May and 
Burcers,* and Beck, SPERRY and Hv’ showed that small-angle 
boundaries between strained crystals and adjacent strain-free ones of 
almost identical orientation had very low mobilities; the presence of 
small unconsumed grains in single crystals grown by the strain-anneal 
method is due to this effect. Similarly, the domain structures observed 
by Ketiar, Hrrscu and THorp, and by HEmDENREICH,” in cold 
worked polycrystalline aluminium do not coarsen on prolonged ageing 
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even though they form practically instantaneously during working and 
are on a very fine scale (~ 2 x 10-*cm). The difficulty of moving 
such boundaries lies in the facts that the glide motions of their disloca- 
tions lie along different directions, and that the climbing motions of these 
dislocations involve diffusion through the regions of “good crystal’’ 
which lie between them.*. © 

The mobility of large-angle boundaries is shown by the well-known 
observation that, in recrystallization and grain growth, the new crystals 
have radically different orientations from those they replace. It seems 
that the development of annealing textures has its origin here. Brcx, 





Fig. 28. Increase in the size of a growing crystal 
during recrystallization 


Sperry and Hv,’ for example, showed that the new crystals which 
grew fastest into a worked crystal were rotated, relative to its orienta- 
tion, by 30 to 40°, about a [111] axis. A large-angle boundary should 
move rapidly by the transference of atoms, singly, from one crystal to 
the other, since each atom in such a boundary is next to a defect similar 
to a vacant atomic site. The movement bears a resemblance to that of 
diffusion in a grain boundary and, as Lomer and Nyre® point out, this is 
consistent with the observation of AcHTER and SMoLUCcHOWSKI"™ that 
grain boundary diffusion becomes much faster when the boundary 
angle is increased beyond 20 

This dependence of mobility on orientation can be used to explain 
unother effect in recrystallization. It is known that recrystallization 
starts slowly,"*. # and that measurements of the linear growth of a 
new grain give a curve of the kind shown in Fig. 28. Over a large range 
the growth increases linearly with time and the rate only falls away 
from this constant value when the new grains impinge on each other. 
However, if the linear part is extrapolated back linearly to zero dia- 
meter it meets the time axis at some positive value ¢,, and this has led 
to the idea that recrystallization is preceded by an earlier stage when 


254 





THEORY OF DISLOCATIONS 


nucleation of new crystals occurs. In the writer’s opinion this extra- 
polation is misleading; there does not seem to be any certain evidence 
that distinctly different processes occur during “‘nucleation’’ and growth. 
A reasonable view is that of BurcERs and LouwErsE,™ and Cann, 
who suppose that the new grains are formed by the growth of initially 
small pieces of the old ones. Now the boundary dividing a growing 
piece from its surroundings is a small-angle one at first, and so its 
velocity of movement will be smaller than at a later stage of growth, 
where the growing crystal will have advanced into more remote regions 
of the crystal, regions which have radically different orientations from 
its own and which make large-angle boundaries with it. 

On this view, the slow start of the recrystallization process is explained 
by a slowness in the velocity of boundary migration during the early 
stages of growth; thus the linear curve ought not to be extrapolated 
straight back to ¢, in Fig. 28, but curved back into the origin, as shown 
by the alternative extrapolation. This view also resolves a difficulty in 
Burcers’ and Caun’s theories, which is to understand why re-crystalli- 
zation begins in the most intensely curved regions of the crystal, in spite 
of the fact that they are the most heavily strained regions and the 
driving force for growth should be least favourable there. We suppose 
that growth starts from them because only there is the orientation so 
widely different from average that a grain growing out of one of them 
can obtain for itself a large-angle boundary; unless it can do this it 
cannot grow at a sufficient speed. This also helps to explain why 
crystals deformed in laminar flow do not recrystallize, on the grounds 
that the orientation in them does not vary widely enough to allow 
large-angle boundaries to be made; similarly it explains why recrystal- 
lization starts so readily from a scratch or similar sharp surface indenta- 
tion. As CaHn™ has already pointed out, the early stages of recrystal- 
lization should be recognizable as polygonization, and TrepEMa’® has 
recently obtained some direct evidence for this. 


CREEP 

The confirmation of some of the general predictions of the exhaustion 
theory of creep, described in T.D.1, has led to further investigations 
with the result that the theory has been both broadened in scope and 
simplified in treatment. We shall now sketch a version of it based on 
the following assumptions : 

(1) The flowing specimen can be regarded as an aggregate of elements ; 
in many but not all creep problems the elements are dislocation lines. 

2) Each element has an activation stress o, (= yield stress of the 
element o, — applied stress c) which has to be applied before it flows. 

(3) With this activation stress is associated an activation energy 
U(c,, ¢) which is an increasing function of ¢,,. 
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4) When an element is thermally activated by a temporary acquisi- 
tion of the energy (, it makes a jump, i.e. contributes an increment v 
to the overall plastic strain. 

(5) U(e,, c) is the same function of ¢, for all elements; in particular 


we shall make the assumption 


U(o,) = Ao,, (oa 


a 


which is consistent with activation energy function of equation (34). 
This function is appropriate when the hardening responsible for a, is 
caused by intersecting dislocations ; with other kinds of hardening other 
functions (e.g. U a o,** for precipitation hardening; see T.D.1) may 
be better 

(6) The increment of strain v is the same for all jumps. 

(7) The elements are independent of each other and cannot jump more 
than once 

Let Vic,, ido, be the number of elements with activation stresses 
between co, and o, + do, at time ¢. Then the rate of creep at this time 


is given by 


N(a,, t) f(a,)de,, ; Sa eee 


where 


vy exp U(e,)/kT} <a 


is the frequency of jumping per element, and » is a frequency factor. 


Since each element jumps only once, the numbers of unused elements 
are exhausted at a rate 

3 

N(a,, t N(a,, )f(¢,), 


> 
< 


which integrates to give the time dependence of \(c,, ¢) as 
»» t) Nia,, 0) exp {— f(c,)t}, 


where .V(c,, 0) is the initial distribution. 
With a suitable assumption for V(c,, 0), e.g. 


Nia,, 0) N (0 
(54) 


and with the energy function of equation (49), the integral in equation 
50) for the creep rate is elementary. However we shall use another 
method which is only approximate but has greater generality and is more 
suggestive physically. With reasonable values for the physical para- 
meters involved, the function exp {—/(c,)t} is practically a step 
function of ¢, when vt > 1. For small values of oc, it is nearly zero, and 
for large ones nearly unity. Only in a very small range of o,, centred 
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about the value for which f(c,)é 1, does it change from the one 
extreme to the other. Thus the exhaustion process can be regarded as 
the advance of a sharp step or thermal front along the co, axis. As the 
step sweeps across each part of the N(c,, 0) curve all the elements in 
that part are activated and make their contribution to the creep. In a 
time ¢ the step arrives at the activation stress oc, defined by 


U(e,) kT log vt. (55) 


or, if equation (49) is used, 


o, = (kT'/A) log vt. (vt 


t 


The creep strain at time ¢ is then 


y(t) =v}| N(e,, 0)de,, 


or, with Nic,, 0) = N, 


y(t) = (vNkT'/A) log vt. (vt 


At time t the stress 
(59) 


is the “‘yield stress’’ of the material in the sense that, if the applied 
stress could be raised to this value, flow could occur without the aid 
of thermal fluctuations. By making sudden small increments of applied 
stress during a creep experiment the presence and position of this 
thermal front can be examined. 

Two investigations have been made recently to check the exhaustion 
theory. Davis and Toompson™ used a precipitation-hardened copper- 
silver alloy and showed that their creep curves obeyed a time law of the 
kind y» o (log vt)**, which is consistent with the exhaustion theory if 
the activation energy function U « o,°* appropriate to a precipitation- 
hardened alloy is used. By means of incremental loading tests they also 
demonstrated the existence of a critical stress, i.e. the existence of the 
thermal front, which could be reached by a suitable load increment 
(e.g. an increment of 1-4 kg mm~* after creep for 24 hours at room tem- 
perature under a constant applied stress of 6 to 8 kg mm~*) and at which 
“instantaneous” plastic flow was produced. This investigation also 
showed that the magnitude of the activation energy expected from the 
precipitation-hardening theory (see T.D.1) was of the right order, but 
that the observed value (=~ | sec~') of » was far smaller than that 
expected (~ 10* sec~') for the vibration of a dislocation against a 


precipitate. This discrepancy has still not been satisfactorily explained. 

The other investigation was that of Wyatt,’ 3° who used copper 
hardened by the working received during the creep process itself. At 
low temperatures (i.e. < 300°K) his specimens obeyed the simple 
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logarithmic creep law of equation (58), and he proved that, under these 
conditions, a mechanical equation of state 


y =f(e, y, T) rules 


/ 


was obeyed. At constant temperature 7’, for example, this means that 
the rate of strain 7 at any given instant depends only on the current 
values of the applied stress o and strain y at this instant, and not on 
past values. Essentially the method of proof was to make sudden 
increments of applied stress during the creep experiment and to show 
that the portions of the creep curve which followed such increments 
could always be fitted exactly on earlier portions of the curve by sliding 
them backwards by a suitable amount. From the distance along the 
logt axis that it was necessary to slide the curves back, Wyatt con- 
cluded that the activation stress increases linearly with log?t, which 
implies a linear relation, equation (49), between activation energy and 
activation stress. A further analysis of Wyatt's results’? shows that 
the activation energy function in these experiments is of the form 


U(a,, ¢) = Ba,/a, 56 és ee 


which is almost exactly what would be expected from the theory of 
intersecting dislocations; this can be seen by substituting B = aud’, 
(aub/l)—-o = o,, and approximating o + o,—=~a, in equation (34). 
Moreover the value of B deduced from the experiments’ is about 4 eV 
which is a very reasonable magnitude for «ub*. These experiments of 
Wyatt therefore lend good support to the idea that work hardening is 
caused by intersecting dislocations having to cut through one another. 

At higher temperatures and larger creep strains logarithmic creep 
usually gives way to a combination of ANDRADB’s ¢'* creep and steady- 
state creep. Wyatt has shown that a mechanical equation of state is 
not obeyed during ¢'’* creep, so that any change made in the exhaustion 
theory to enable it to deal with these other forms of creep would have 
to be a profound one. Such a change has been suggested recently,’ 
based on the idea that once an element has jumped, it is not necessarily 
unable to jump again, but faces an energy barrier which is uncorrelated 
with the one it has jumped and is chosen at random from the whole 
range of barriers. This idea is suggested by the model of a dislocation 
meeting a random sequence of obstacles during its glide motion through 
a slip plane. It leads to a creep function which contains a steady-state 
term and has a transient term rather similar to, but not identical with, 
the ¢'* function. Except at extremely small strains, where the creep 
function approximates to the logarithmic type, a mechanical equation 
of state is not obeyed. 

A formal theory of steady-state creep, the so-called recovery theory, 
can be built up on the idea that the creep rate remains constant because 
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work hardening and thermal softening are balanced against each other 
during this kind of flow.™*. 4%, 227 Thus Orowan™® defines a coefficient 
of work hardening, h = dc0/dy, and a rate of recovery, r = — 3a/dt. 
During steady-state creep the yield stress remains constant, 

do = o ay ~ = dt = 0, 


so that the rate of steady flow is given by 
k= py =rh. . . « « (62) 


A consequence of the theory is that if the applied stress is suddenly 
reduced by a small amount, during steady flow, there should be a pause 
in the flow before the new steady rate is taken up, since the yield stress 
has first to fall by recovery by the amount that the applied stress was 
reduced. The existence of these induction periods has now been 
confirmed.!*’. © CorTrrRELL and AyTEKIN,!”’ for example, showed with 
zinc crystals that the observed time of induction can be accounted for 
approximately, with the aid of equation (62), in terms of the reduction 
in stress, the rate of steady flow, and the coefficient of work hardening. 
They also observed that the dependence of « on applied stress and 
temperature could be represented by the relation 


x = C exp {— (QQ — fo)/kT} 5a, <a 


where the parameters Q and £, when measured in a standard range of 
o, had the same values as in recovery experiments on the same material. 
Since equation (62) allows us to write — (1/h)(da/dt) for x, equation (63) 
is essentially the differential equation for recovery which, as mentioned 
on page 253, integrates to give the logarithmic time law of softening. 

The explanation of the final stage of the creep curve, in which the 
rate of flow increases progressively to fracture, remains an unsolved 
problem. Various suggestions have been put forward invoking effects 
due, for example, to changes in cross-section, opening of cracks in 
heavily worked slip planes and near grain boundaries, recrystallization, 
and metallurgical phase changes; accounts of these have been given in 
recent books. However, there is no chance of making a detailed 
quantitative or even qualitative, theory of the late stages of creep until 
we have a more intimate picture of the various structural changes which 
occur during a creep experiment, which accounts for the great contem- 
porary interest in metallographic studies of creep.!**: 1%, 152 

Two features which have claimed most attention are the cell- 
structures formed within the grains, and the sliding of grains past one 
another along their boundaries. The nature of cell-structures and their 
relation to polygonization have already been discussed on page 252. 
Slip along grain boundaries has been studied directly by observing the 
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sliding of one crystal over another in a bicrystal,’ and indirectly by 
means of internal friction measurements.’ ©* The advantage of this 
second method is that the movements involved are very small so that 
extraneous effects, due to interlocking of crystals at places where the 
boundaries change their direction, are reduced. On the other hand, 
because it is an indirect method of study, the interpretation of its results 
is less certain. Theories have been developed, for example, in which the 
grain boundaries are considered to act either as surfaces along which 
slip occurs,’*, 4%. 4 rather as if they consisted of monatomic layers ot 
liquid, or as sources and sinks of vacancies, in which case creep is 
caused by the migration of these vacancies through the grains from one 
boundary to another.'*, 157, 158 

Direct observation of displacements at grain boundaries’® and of 


plastic deformation’ and cracks in grains at the ends of such 


boundaries leave no doubt that grain boundary slip is a most important 
feature of creep in polycrystals at high temperatures. These boundary 
movements cannot occur without some sympathetic plastic deformation 
at the corners where grains interlock and the final rate of flow must 
depend on this deformation within the grains, even although it is 
started by slip along the boundaries. The nature of the interaction of 
slip on the boundaries with flow in the grains is the problem which 
must be solved if the accelerating stage of creep is to be properly under- 
stood, or if the theory of creep is to be of practical value in the design 
of creep-resistant alloys. Here, as is usual in these more complicated 
problems of crystal plasticity, one looks to metallographic examination 
to provide the vital clues to inspire the theory, and there are signs from 
the recent work!*, 452, 1° that they will not be long forthcoming. 
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DIFFUSION IN METALS 


A.D. Le Claire 


Tue last few years have been witness to a widening interest in the 
phenomenon of diffusion in metals and it is the purpose of the present 
chapter to summarize the more important advances that have been 
made, particularly since the time of writing of Volume I of this series,’ 
which will be referred to hereafter as PMP.1. 

Of greatest importance to our general understanding of chemical 
diffusion in polycrystalline materials, the case of most practical 
interest, has been the work on the Kirkendall Effect and on grain 
boundary diffusion processes and the first two sections are devoted to 
these topics. A knowledge of the fundamental atomic process by which 
diffusion occurs is essential for a complete understanding of any diffu- 
sion phenomenon and the discussion of this topic forms the subject of 
the next section. This includes an account of the first appreciable 
successes which have been achieved in relating experimentally deter- 
mined diffusion coefficients to theoretical expressions for them. A few 
new and interesting methods for measuring diffusion coefficients are 
described in Section IV and in the last section an account is given of 
recent developments in the phenomenological and kinetic treatments of 
chemical diffusion. 

Perhaps the most important result of the last few years work has 
been to provide a clearer understanding of the nature of the problems 
involved in studying diffusion in metals and so to point the way to the 
type of experiments which are most likely to further progress. 


SECTION 1 


THE KIRKENDALL EFFECT 


The original experiments of SMiGELSKas and KIRKENDALL,? revealing 
what has now come to be known as the “Kirkendall Effect,’ were 


performed in 1946 and reviewed in PMP.1. Since then a considerable 


amount of experimental work has been devoted to confirming their 
results, to examining them in more detail and to demonstrating the 
occurrence of the Kirkendall effect in an extensive range of systems. 
The importance of the Kirkendall effect lies in the assistance it gives 
to elucidating the precise atomic mechanism of diffusion, in its demon- 
strating the inadequacy of a single diffusion coefficient for fully 
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describing the diffusion behaviour of a binaryalloy, and when studied in 
more detail, in revealing the complexity of the process which experiments 
on chemical diffusion seek to study. 

The first of these points is dealt with in Section III. In this section 
we shall review the experimental evidence on which the second two 
points are made and include a reference to the more refined phenomeno- 
logical description of binary diffusion which the occurrence of a 
Kirkendall effect demands. 

As was described in more detail in PMP.1 SmicecsKas and AIRKEN- 
DALL laid a parallel set of fine molybdenum wires along each of a pair of 
opposite faces of a rectangular bar of 30 per cent a-brass and then 
plated it overall with a thick layer of copper. After annealing to produce 
diffusion the opposite sets of molybdenum wire markers were found to 
have moved in towards one another by an amount proportional to the 
square root of the annealing time and greater than could be accounted 
for by density changes due to changes in composition. This shift of 
markers placed at the original interface between two interdiffusing 
metals or alloys, relative to parts of the sample where no diffusion 
occurs, is what is known as the “Kirkendall Effect.” It has now been 
confirmed for various compositions and temperatures in copper «-brass 
couples by Correa Da Sitva and Meat,’ BtcKie and Bir,‘ and 
BarNes® and shown by these investigators and by SerrH and Korrt- 
MANN® 5 to exist also in many other single-phase interdiffusing 
systems such as Cu/Ni,* 5 * 8 Cu/Au,? Ag/Au,* * 8 Ag/Pd,® 
Ni/Co,*:  Ni/Au,®: ™ and Fe/Ni™* and in the a-ranges of Sn/Cu,* and 
Al/Cu.?: 4 

A variation in technique employed in many of these experiments was 
the use of “sandwich samples’ prepared by welding to the opposite 
faces of a plate or disc of one of the metals or alloys, on which the 
markers were placed, plates or discs of the other metal or alloy. In all 
cases where measurements were made to show it,?~* the extent of the 
marker shift was again found proportional to the square root of the 
annealing time, strongly suggesting it to be a true diffusion pheno- 
menon ; it also increases with temperature at fixed annealing time, and 
is larger the greater the initial difference in composition between the 
two halves of the diffusion sample. In all cases the movement is 
towards that side of the sample with the lower melting point®—towards 
copper in copper-nickel samples but towards gold in copper-gold samples. 
The observed shifts are of order 1,10 mm for the times and temperatures 
normally employed in diffusion experiments. 

CoRREA Da Srtva and Mest’ measured the Kirkendall effect in a 
large number of sandwich samples both by the direct SmiGeLskas and 
KIRKENDALL method and by new methods. These included, for 


example, measuring relative to the centre of a foil marker, the shift 
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along the diffusion direction of fine wire markers inserted at the same 
interface (‘foil reference method”) or of the edge of the foil itself, 
which was found to keep pace with the wire markers (‘foil bend 
method’’). See Fig. 1. No diffusion occurs across the centre of the foil 
which, therefore, always defines the original position of the interface. 
Such methods avoid measuring smal! changes in distance between well 
separated sets of markers as in sandwich samples, which might be 
affected by secondary processes unconnected with diffusion, but at the 


Meferencre 














Fig. 1. Kirkendall Effect Experiments 


same time might be expected to interfere much more than wire markers 
with the unidirectional diffusion flow. They demonstrated that what- 
ever method of measurement was used and whatever type of marker 
was employed, be it alumina powder, carbonized silk thread or any one of 
a number of different metal wires or foils, the extent of the marker shift 
for any one system was always roughly the same for a given temperature 
and time of anneal; this was true, most strikingly, even for platinum 
wires which dissolved to produce as ‘“‘markers’’ regions within the lattice 
itself of high platinum concentration. This confirms that the markers 
play only a passive role in the process, serving merely to reveal a 
movement relative to one another of different parts of the lattice: to 
be explicit we may assume they are fixed relative to the lattice. Using 
the copper «-brass case as an example, an immediate inference from the 
Kirkendall effect then is that relative to the lattice the number of 
zinc atoms flowing from the brass towards the copper must be greater 
than the number of copper atoms flowing in the opposite direction to 
take their place. The copper side of the interface must expand, new 


lattice planes being created, to accommodate the increased number of 
atoms and if a non-defective lattice in which all sites are occupied is to 
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be maintained the brass, which suffers a nett loss of atoms, must at the 
same time shrink. Thus is produced a motion towards regions richer in 
the more rapidly diffusing zinc of all lattice planes in the diffusion zone, 
and in particular of wire markers at the original interface, as observed. 

Several processes have been suggested which may lead to the 
unequal opposite flows of the two species of atoms which the Kirkendall 
effect implies. It now seems unlikely that a preferential transport of one 
component through the vapour phase is in general important, as was 
proposed’ for the zinc in «-brass-copper couples, if only because a 
Kirkendall effect is found in such systems as the copper-aluminium 
one*: * where the vapour pressures are roughly equal and about 10° 
times less than that of zinc. In addition, successive investigations by 
different workers’: *: *.° on 30 per cent «-brass-copper couples have, 
through changes in technique, effected a removal or variation of special 
facilities for possible vapour transport such as porosity in electroformed 
deposits and imperfections at the initial interface, without leading to 
any appreciable change in the magnitude of the Kirkendall effect. This 
insensitivity of results to a most probable variation also of grain struc- 
ture provides evidence against a second possibility that the higher 
rate of flow of one component arises principally from its faster diffusion 
at grain boundaries. However, slight bulges in the moving interface in 
the direction of an increased Kirkendall effect have been observed in 
nickel-copper samples® near those nickel grain boundaries where suit- 
able etching has revealed a preferential diffusion of copper, but these 
only represent a very small addition to the shift of interface apparent in 
regions well away from grain boundaries. Furthermore, Kirkendall 
effects have been observed®: '* in samples the grain size of which was 
purposely made large (3-15 mm). It appears then that we must assume 
the different rates of transfer to be associated principally with differ- 
ences in the diffusion rates of the two atoms in the lattice itself. 

It is only at this stage that we are able to reject the possibility of a 
ring diffusion mechanism for those alloy systems in which a Kirkendall 
effect has been found (see Section III). 

The Kirkendall effect experiments, however, do not constitute the 
first or only evidence available to show that relative to the crystal 
lattice the two species of a binary alloy diffuse at unequal rates. As 
early as 1933 Serra and Kert™ showed that the diffusion rates, at low 
concentration, of several elements in lead were substantially greater 
than the self diffusion rates of lead in the same alloys. Jonnson," also, 
in 1942 demonstrated by direct measurement the inequality of the 
self diffusion rates of silver and of gold in a 50 per cent alloy of silver 
and gold. 

Btckie and Bir‘ have suggested that the two species diffuse by 
two different mechanisms, the zinc, say, by an interstitial mechanism 
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and the copper by a vacancy mechanism, so that unequal diffusion 
rates are to be expected. In the absence of any other evidence for 
interstitial solid solution in either «-brass or in any other system showing 
a Kirkendall effect, and in view further of Huntincron and SerrTz’s 
calculations (see Section III), this is not a very acceptable suggestion. 
Serrz® first clearly pointed out that diffusion of both species by the 
vacancy mechanism is perfectly adequate to describe the greater 
transfer of, say, zinc than copper, for it is only necessary to assume that 
vacancies exchange with zinc atoms more readily than with copper. 


The Phenomenological Description of Diffusion with a Kirkendall Effect 
and Further Phenomena Associated with Chemical Diffusion 


Whatever the mechanism of diffusion the demonstration of unequal 
rates of flow for the two atomic species led DARKEN® to propose that two 
separate diffusion coefficients D, and D, should be defined, one for each 
species, specifying their respective rates of flow relative to the markers, 
or to planes of the lattice. With certain simplifying assumptions he 
then showed, as described fully in PMP.1, that on account of the 
difference in rates of flow, planes of the lattice, or markers, at points in 
the diffusion zone where the concentration gradient is ON,/dx will 
move relative to points where there is no concentration gradient and 
so well removed from the diffusion zone with a velocity v given by 


v = (D,— D,)aN,/ox ae 

Also, the single chemical interdiffusion coefficient as normally 

measured experimentally, and referred strictly to the MatTano® inter- 

face, is given in terms of the separate or “‘partial chemical diffusion 
coefficients’ D, and D, by 


where NV, and N, are the fractional concentrations of the two species 


at the composition to which D, D, and D, refer. Since D, + D,, D 
necessarily varies with composition, although such variation is due also 


to D, and D, since D does not appear to vary linearly with composition. 

D measures the rates of flow relative to a surface defined such that 
equal numbers of atoms of each species diffuse in opposite directions 
across it. This has the merit of simplicity in that such rates of flow are 
equal for the two species, and in many practical cases D is an adequate 
measure of the diffusion behaviour of an alloy. But a fuller description 
is obtained in terms of D, and D, and a better understanding of diffusion 
in binary alloys would probably be obtained through study of these 
theoretically simpler and more fundamental coefficients than through 
study of D, as has so often been attempted in the past. 


269 





PROGRESS IN METAL PHYSICS 


In sufficiently dilute solution of, say, A in B, D, =D. But in 
general D, and D, can only be calculated, using equations (1) and (2), 


from simultaneous measurement of D and v. It is very often found that 
the marker shift xz is proportional to ¢”?, ¢ being the time of anneal. In 
such cases v can be obtained from a single measurement of x through 
the equation v = x/2t. However, Le CLarre™ has pointed out that there 
is no obvious reason why in general xz should be proportional to ¢”?, 
although such a relation holds when there is no concentration change 
at the markers, a condition approximately fulfilled for markers placed, 
as they usually have been, at the original interface. SkITH and 
Kotrmann?™ have recently studied the movement of markers placed 
in positions in the diffusion zone other than at the original interface 
between the two dissimilar metals, and it can be seen from their 
results that, for some markers, approximately doubling the time of 
anneal more than doubles even the corresponding values of z. The 
relation z ~ ¢t”? is then not in fact generally true and v is best deter- 
mined from a series of measurements of z and ¢. 

But such determinations of D, and D, are based on the validity of the 
assumptions made in deriving these above equations. DARKEN 
assumed that changes in lattice parameter on diffusion could be 
ignored, that a non-defective lattice was fully maintained by complete 
shrinkage and that both expansion and shrinkage took place only along 
the direction of diffusion so maintaining constant the cross section 
through which diffusion occurs. The first assumption, never strictly 
valid of course except in self diffusion measurements, may be avoided 
simply by measuring distances in diffusion samples not in terms of 
centimetres but in numbers of lattice spacings. This practice has been 
adopted in recent years by many workers.* The validity of the other 
assumptions is a matter for experiment. 

A slight porosity in the brass side of their copper a-brass samples 
was reported by SMIGELSKAS and KiRKENDALL’ and has been confirmed 
by later workers.*~* A similar porosity, consisting of large numbers of 
tiny holes on that side of the interface which the direction of the marker 
shift indicates is suffering a nett loss of atoms, has been found also in 
many other systems, such as Fe/Ni,’* Ag/Au,™ Cu/Ni,® ® 18 Cu/Al,: 4 
Ag/Pd,*: ** and Ni/Au.*: ** The amount of porosity varies consider- 
ably from system to system and sometimes may only be revealed by 
very careful mechanical polishing. It generally increases with increasing 
amount of diffusion and is greater the larger the initial difference in 
concentration. The density of holes is often not uniform throughout 
the side of the sample losing atoms but shows a well defined maximum in 
one region, the position of which has been observed in at least one case!** 
to move as diffusion proceeds so as to occur always at a particular con- 
centration in the diffusion zone, its distance from the interface varying 
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parabolically with the time of anneal. The pores may in some cases 
occupy, at the position of maximum density, a considerable proportion 
of the cross section of the diffusion zone in that region, and may join up 
in places to form larger cavities and fissures. Serra and Kotrmann?” 
for example found this proportion to be as much as 30-40 per cent for 
a pure gold/pure silver sample annealed for 98 hours at 900°C. Occasion- 
ally the holes appear*: ® to be octahedral in shape with faces parallel to 
{111} planes. For this reason, for the fact that they are produced as 
readily in samples made from exclusively vacuum cast material‘ and 
because they are uninfluenced by variations in the atmosphere in which 
samples are annealed,® the porosity does not appear to be caused in any 
way by dissolved gases or by secondary gas reactions. From its location 
in the diffusion zone then the existence of porosity strongly suggests 
that shinkage is incomplete. A sensitive test for complete shrinkage is 
to compare the actual] marker shift it produces with that to be expected 
from the total nett transfer of atoms across the marker interface, as 
determined, for example, by subsequent chemical analysis. The two are 
only equal when complete shrinkage occurs; otherwise the observed will 
be less than the calculated shift. SmiceLskas and KiRKENDALL” found 
that this latter was the case in their experiments and attributed the differ- 
ence to their observed porosity. Now in a sandwich sample of brass 
between copper the inward marker shift is due to the shrinkage of 
brass, but in a sandwich of copper between brass the outward marker 
movement is due to the expansion of the copper. Bick ie and Bur‘ 
compared the shifts in these two types of samples and showed that the 
expansion of the copper, which was quite regular from sample to 
sample, was greater than the shrinkage in the brass, which showed more 
variation between successive samples, by an amount which corresponded 
quantitatively with the measured total volume of the holes that they 
found in the brass. Similar results were obtained by SgITH and 
KotTrmann who, by placing additional markers outside the diffusion 
zones at the ends of sandwich samples, were able to compare for each 
sample the two measurements of the Kirkendall shift made relative 
to points, outside the diffusion zone, on each side of an interface. The 
end markers in such samples are also generally found®: * 1° to have 
moved apart after diffusion revealing an overall increase in length of 
the samples. This is of course to be expected if a porosity or incomplete 
shrinkage occurs, as also is a decrease in the overall density. Such 
changes of length and density are most apparent when magnified by 
using as diffusion samples a stack of strips or foils of the two metals 
packed alternately. In this way Barnes® showed the change in length 
to be proportional to the number of diffusion interfaces and to the 
square root of the annealing time and the change in density to cor- 
respond roughly to the total volume of holes. 
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The available evidence, both quantitative and qualitative, is then 
that this porosity is intimately connected with the diffusion process, 
and in view of its location in the diffusion zone, is indicative of an 
incomplete shrinkage of that part of the diffusion zone suffering a nett 
loss of atoms. As a typical example, BUcKLE and Bir‘ found the 
marker shift referred to points outside the diffusion zone on the brass 
side of copper/a-brass samples, to be between 1/2 and 2/3 of the more 
consistent values measured relative to points on the copper side. How- 
ever, CORREA DA Sritva and Ment’ considered that porosity was 
negligible and shrinkage effectively complete, for in the many samples 
they examined they found the Matano interface, as computed graphic- 

ally from plots of concentration as a 

function of distance expressed in lattice 

spacings, to coincide with the original 

position, relative to the ends of the sample, 

Welded Porosity of the initial interface between the two 

jatorlase halves of the sample. While this is a per- 

Fig. 2. Lateral Dimension fectiy valid test for complete shrinkage, it 

Changes in Cu-Ni Diffusion tad nampa , 

Sample (Ref. 6 as BaLurri and ALEXANDER point 

out, a very insensitive one, for the graphi- 

cally computed Matano interface varies only slightly in position for 
considerable changes in porosity. 

CoRREA DA Srtva and Mesw* measured the lateral shift of foil 
markers placed parallel to the direction of diffusion (Fig. 1) in a copper 
a-brass specimen and concluded that since it was negligible all dimen- 
sional changes were in fact unidirectional as assumed by DarKkeEn.* 
But even the shift to be expected from lattice parameter changes at the 
interface, which would be in the same direction as that due to a lateral 
shrinkage, was not observed, and it may well be that an expansion 
produced by porosity compensated for this expected contraction. How- 
ever, examination of the free surface of a specimen, where there is less 
constraint upon the tendency to expand or shrink, imposed by surround- 
ing material, has provided evidence that such dimensional changes may 
take place normal to the direction of diffusion. Fig. 2 shows the situa- 
tion observed in nickel/copper couples by Serru and Kotrmann,* and 
similar effects have been observed'™ by these authors in all the systems 
they studied. The Kirkendall shift is towards the copper in this system 
and an apparent expansion and shrinkage leading to an increase and 
decrease in the diameter of the sample on the sides of the interface 
suffering respectively a nett gain and a nett loss of atoms, is clearly 
evident. Similar effects have been noticed before by Jonnson™ and by 
FRAENKEL and Hoveen™ in gold/silver diffusion samples. JOHNSON 
attributed the whole of the effect he found to evaporation but SEITH and 
KoTTMann prevented free evaporation by packing their samples in 
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alumina. In addition they observed in silver/palladium samples an 
overall decrease in length which they attribute to the nett lateral swell- 
ing of samples and which presumably is greater than the increase in 
length the porosity produced. This could not arise from vapour trans- 
port. That there is a tendency operating across the whole of the sample 
to shrink on one side of the interface and expand on the other is evident 
from the marked bending, observed by Barwngs,° in nickel/copper 
samples that were very thin in the direction of diffusion. The curvature, 
uniform across the sample, was such that copper was on the concave 
side. Thermal expansion effects would have produced the opposite 
curvature in this experiment. SEITH and Kotrmann!® estimate that 
for one of their silver-gold samples (98 hr at 900°C) only two-thirds of 
the expansion on the gold side of the interface occurred along the direc- 
tion of diffusion, the rest occurring laterally to produce an increase in 
diameter in that part of the diffusion zone. 

These results show that the assumptions underlying equations (1) and 
(2) may not always be even approximately satisfied experimentally. The 
occurrence both of porosity and of lateral dimension changes means 
that the cross section through which diffusion is occurring is not 
constant in time or uniform throughout the diffusion zone and may be 
very appreciably reduced in the region of maximum porosity. This 


may seriously impair the accuracy of measurement of D. Serru and 


KotTrMann! determined from their Ag/Au and Fe/Ni samples, using 
the Matano method, the variation of D with composition for these 
systems and in each case the results showed what is most probably a 
false minimum for D near those compositions at which maximum 
porosity occurs. 

The same effects will also in general influence the value of v. The 
best value of v for use in equation (1) is obtained by measuring the 
marker shift relative to points outside the diffusion zone, on that side of 
the interface richer in the slower diffusing component: i.e. by using 
samples in which the shift measured is produced by the expansion of 
that part of the diffusion zone receiving a nett addition of atoms. The 
direct effect of porosity in producing a lower value of the shift when 
this is measured relative to points on the other side of the diffusion 
zone is thereby avoided. But there remains the indirect effect of 
porosity, through its influence on the general course of diffusion, and 
the direct effect of the expansion being to some extent lateral to the 
direction of diffusion. 

However, it has usually been experiments in which two dissimilar 
pure metals are diffusing into one another that have revealed marked 
porosity and lateral dimension changes. When the initial difference in 
concentration is smaller, as for diffusion between one metal and an 
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alloy of it with another, these phenomena are less marked and their 


effect on measurements of D and v may be well within the normal 
experimental errors. But no attention has been given in the past to their 
possible influence and many experimental results may be suspect for 
this reason, especially those from samples covering a large concentra- 


tion range. 


The Vacancy Mechanism of Diffusion and the Kirkendall Effect 


We now return to the vacancy mechanism description of diffusion in a 
binary system, using the copper/z-brass case for convenience in discus- 
sion. If the vacancies exchange with zinc atoms more readily than with 
copper atoms then the flow of zinc atoms towards the copper, arising 
from a concentration gradient of zinc atoms, will be greater than the 
flow of copper atoms towards the brass, as the Kirkendall effect indi- 
cates is actually the case. But since each atom which moves in one 
particular direction causes a vacancy to move in the opposite direction, 
there will be a nett flow of vacancies from the copper towards the brass 
equal to the nett flow of atoms, which is towards the copper. (A 
mathematical expression for the nett vacancy flow in terms of different 
probabilities of exchange of a vacancy with the two species of atoms 
has been given by Serrz: * and is dealt with in Section V.) The 
interdiffusion of brass and copper is then accompanied by a pumping 
of vacancies from copper to brass, and we might expect the vacancy 
concentration to fall below that corresponding to equilibrium in the 
copper and to exceed it in the brass. But the observed nett transfer of 
atoms towards the copper is much greater than could be effected by 
that number of vacancies which are normally in thermodynamic 
equilibrium with the volume of copper usually employed in the experi- 
ments, so it is clear that some source of vacancies must be operating in 
the copper to produce the large number of extra vacancies required. 
Also, the nett number of atoms lost from the brass to the copper is 
replaced by an equal number of vacancies in the brass and since the 
volume of a vacancy is about the same as that of an atom the vacancies 
must disappear in some way if the brass is to shrink and the markers are 
to move towards the brass side of a diffusion sample. 

Possible sources for vacancies are at the surface of the specimen, or 
at the grain boundaries or dislocations it contains. If the surface acted 
as the predominant source, as once suggested by Serrz,” then we 
should expect the Kirkendall effect to depend on the volume of the 
diffusion sample and on whether the outer surface was of copper or of 
brass, but such dependence has never been found experimentally.*~* 
In particular the marker shift is always towards the brass and by about 
the same amount in samples with copper sandwiched between brass and 
with brass sandwiched between copper, any difference between the 
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two being satisfactorily accounted for as due to porosity. In the 
absence of experiments on the Kirkendall effect in single crystals which 
might decide whether or not grain boundaries alone were the predomi- 
nant source, the most attractive and simple sources of vacancies are the 
dislocations." Fig. 3 shows a simple edge type dislocation. If an atom 
at S moves down to 7' a vacancy is left at 
S and may then diffuse away by exchange 
with other atoms. If this is repeated by 
all S atoms in planes parallel to the 
figure, the whole dislocation will have 
moved along ZS by one lattice spacing 
without change of form and one vacancy 
will have been produced for each lattice 
plane. If a dislocation moves in this way 








right across from one side of a crystal to 
another, a new plane of atoms will have 
been inserted into the lattice and the two 
halves of the crystal will have moved 
away from one another by one lattice 
spacing to accommodate it. Accommodation is therefore provided 
for the nett number of atoms diffusing into the copper by the very 
process which produces the vacancies necessary for their transfer. 
There is clearly a limit to the extent to which any one simple edge 
dislocation may act as a source of vacancies, for the completion of the 
extra plane of atoms means the removal of that dislocation from the 
crystal. It is doubtful whether the density of dislocations, if all of the 
simple edge type, would be sufficient to sustain 
| J this loss without it influencing the course of the 
a diffusion. But a dislocation line is not in general 
ie of the simple straight edge type. It may form a 
~ / closed ring of one type or another or may meander 
across the crystal from one side to the other, 
assuming an edge character in some regions and 
a screw character elsewhere, much as in Fig. 4. 
——— lage Each section of edge character, such as mn, may 
— act as a source of vacancies in much the same 
Fig. 4. General Dis- way as described above but extra atoms are not 
location line with r 
nies ant Gane Over added to complete a single plane but s0 as to 
ponents build up as a spiral surface continuous with the 
spiral structure of, and therefore extending, the 
screw dislocations joined at the ends of the edge. For each complete 
turn of the spiral the edge is merely moved along the slip direction 
without change of form. Atoms may then be added indefinitely, that 
half of the crystal to the left of mn moving relatively to the other half 





Fig. 3. Simple Edge 
Dislocation 








—— 
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in order to accommodate them, and any number of vacancies produced. 
The process is completely analogous to the mechanism of crystal growth 
proposed by FRranxK"’ (see Chapter 5 of this volume), the edge disloca- 
tion to which atoms are added corresponding to the ledge to which 
they are added on the growing face of a crystal. Other types of 
dislocations which would act as extensive sources of vacancies have 
been described by Serrz."* 

When we come to consider mechanisms for removal of vacancies we 
notice at once that the above-mentioned processes have only to be 
reversed for dislocations to act as sinks for vacancies. Thus’ a vacancy 
arriving on any of the sites surrounding the atom S may exchange 
with it and so raise the dislocation line one spacing along SZ. Vacancies 
condensing on the dislocation line in this way ultimately remove a 
whole plane of atoms and cause the two halves of the crystal to move 
towards one another by one atomic spacing. Similarly a general 
dislocation line with edge and screw components may act as an effec- 
tively infinite sink for vacancies merely by reversal of that process by 
which it acts as an infinite source. 

An alternative proposal’: is that vacancies may aggregate on 
particular planes to produce platiform voids which, when they reach a 
critical size, collapse by inducing plastic flow in the surrounding 
material and with a resultant overall shrinkage. Such a collapsed 
platiform void has all the properties and characteristics of a general ring 
dislocation with a component of its Burgers Vector normal to its plane 
(“Prismatic Dislocation’’) which, as Serrz points out,® are sufficient 
to explain all those properties of a crystal for which dislocations are 
usually invoked. In fact he suggests that the dislocation content of a 
crystal may arise solely in this way by condensation, during solidifica- 
tion and cooling, of the very large number of vacancies present in the 
molten metal. In particular such rings may act as further sinks (or 
sources) for vacancies in addition to those provided by other types and 
forms of dislocation. 

The presence of a porosity in diffusion samples suggests that vacancies 
may also aggregate in a more isotropic manner to produce roughly 
spherical cavities. These we might expect to be stable when larger than 
some critical size and to grow by further accretion of vacancies into the 
visibly observable voids as first suggested by Le Clarke and Barnes.” 
Produced in this way, it is not surprising that they should have 
crystallographic shape, for vacancies will exchange preferentially with 
atoms whose removal from the surface of the cavity produces faces of 
minimum surface energy. The incomplete shrinkage of that side of a 
diffusion couple suffering a nett loss of atoms, may then be attributed 
to formation of relatively stable aggregates of some of those vacancies 
which have effected this nett loss. Complete shrinkage only occurs when 
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the vacancies in excess of those corresponding to thermal equilibrium 
are removed completely by condensation on dislocations. 
The nett flux of vacancies at any point is given by 


D, . d¢,/da + D, . de,/dx = (D, — D,)dc,/dx 


and SerrH and KoTrmann!™ suggest that the maximum porosity 
occurs in a region of the diffusion zone where the concentration is such 
that this expression is a maximum. 

If the expansion in the copper and the contraction in the brass are to 
be only along the direction of diffusion, the planes of atoms removed or 
added during the operation of a dislocation as sink or source of vacancies 
must be normal to this direction. The dislocation must then have its 
Burgers Vector parallel to the diffusion direction. But it is dislocations 
of just this orientation which will be most employed as sources and 
sinks, for if of other orientation those parts of their dislocation edges 
which are advancing along the diffusion direction would soon run out of 
the diffusion zone, and the capacity of that dislocation as a source or 
sink for vacancies in consequence be reduced. BARDEEN and HERRING” 
propose this as an explanation of CorrEa DA Sitva and MEHL’s result 
(see p. 272) that there is no appreciable lateral change of dimensions on 
diffusion. While we have seen that this result is not in general true, it is 
true that the expansion and shrinkage take place to a much greater 
extent along the direction of diffusion than would correspond to 
isotropic expansion and shrinkage. A further factor contributing to 
this may be that the surrounding undiffused material will impose 
constraints upon a lateral shrinkage or expansion of the diffusion zone 
i.e. upon the removal or growth of planes of atoms in a direction 
parallel to that of diffusion, but not in directions orthogonal to this. 
When the diffusion zone extends over the whole volume of a sample, so 
that there is no undiffused material, the dimensional changes accom- 
panying diffusion are in fact more nearly isotropic, as recent experiments 
on diffusion into very thin foils and wires have shown.’ 

The growth or removal of planes of atoms in both single crystal and 
polycrystalline material will introduce stresses into the lattice, in 
particular between neighbouring grains in polycrystalline material. 
These may only be relieved by inducing recrystallization and grain 
growth, by introduction of new dislocations and, therefore, of further 
sinks and sources for vacancies, and by processes of plastic deformation 
which themselves contribute to the observed marker shift and lateral 
changes in dimensions. Additional stresses with similar consequence 
will arise from planes of atoms being forced to conform to changes in 
lattice parameter on diffusion. Recrystallization and grain growth in 
the diffusion zone between two metals has often been reported in the 
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past** and a more recent observation® of “‘polygonization’”’ in the same 
region provides experimental evidence for the plastic flow. 

Rough calculations: * have claimed to show that the normal 
dislocation content of a metal is adequate to maintain an equilibrium 
concentration of vacancies throughout the diffusion zone, despite the 
presence of a nett flux of vacancies. For such to be the case the average 
distance travelled by each vacancy during its lifetime must be very 
much less than that over which diffusion occurs, and it is considered 
that the normally assumed density of dislocations should be adequate 
to ensure this, even when by no means every encounter of a vacancy 
with a dislocation leads to its annihilation. However, the occurrence 
of a marked porosity in many diffusing systems would appear to invali- 
date these conclusions and there seems no reason to suppose that a 
vacancy concentration which is everywhere in thermodynamic equili- 
brium with the lattice is in fact maintained in all cases of diffusion. 

The creation and annihilation of vacancies will take place more 
readily at irregularities along an edge dislocation, such as “‘jogs’’,”* so 
that the effective number of points where such processes can occur may 
in fact be an even smaller fraction of the number of atoms along 
dislocation edges than these calculations have allowed for. 


Conclusions, relating to the Theoretical Value of Chemical Interdiffusion 
Measurements 


The demonstration of the various phenomena associated with the 
Kirkendall effect makes it difficult to know what theoretical significance 
may be attached to the results of chemical diffusion measurements. 
We have already discussed the difficulties associated with the measure- 
ment of D, and D, through use of equations (1) and (2), arising from 
porosity and changes in lateral dimensions, but even when these are 
sufficiently small not to affect the calculation of D, and D, from 


measurements of D and v, there would still appear to be several 
factors controlling the values obtained which are not normally taken 
account of, and which may be very difficult to take account of, in any 
theoretical understanding of their magnitudes. 

Theoretical treatments usually assume that the rates of chemical 
diffusion in an alloy depend only upon the vacancy concentration and 
upon the different probabilities of exchange of the two atoms with a 
vacancy in the presence of a concentration gradient, these probabilities 
being determined only by the local composition. It is assumed in 
theoretical treatments that the vacancies are in equilibrium with the 
lattice but in the case of chemical diffusion there exists the possibility 
of a vacancy concentration which is not only removed from its equili- 
brium value but also continuously changing during the diffusion. Also, 
it has been suggested** that vacancies may aggregate to form double 


Dy had 
~/i 





DIFFUSION IN METALS 


vacancies in regions of high vacancy concentration and that these 
would be more mobile than single vacancies and so lead to a high 
diffusion rate for the atoms. Abnormally large diffusion rates may then 
occur in that side of a diffusion couple which suffers a nett loss of atoms. 
We may mention that two independent diffusion coefficients D, and D, 
may only be legitimately defined by Fick’s Law, and so equations (1) 
and (2) are only strictly valid, when the vacancies are everywhere in 
equilibrium with the lattice. Otherwise the rate of flow of, say, A 
atoms depends not only on the concentration gradient of A atoms, but 
also on that of the vacancies. This is discussed more fully in Section V. 
Furthermore, there is the possibility of stresses influencing the rate of 
diffusion and of inhomogeneous plastic flow concurrent with the diffu- 
sion, which later may generate additional vacancies and produce 
distorted lattice regions in which abnormal diffusion rates occur. The 
last two factors are associated with the shrinkage and expansion which 
accompany chemical diffusion and, with the addition of stresses arising 
from the interdiffusion of atoms of unequal size, will tend to produce 
recrystallization and so an enhancement of the grain boundary contribu- 
tion to the overall flow. 

The diffusion rates then appear dependent upon a number of quanti- 
ties many of them changing during diffusion and varying with the 
nature of the sample and all difficult to determine separately. The 
relative importance of these several factors in determining the measured 
values of chemical diffusion coefficients and their variation with 
composition and the details of their operation in chemical diffusion, 
remain matters for further experimental study, but their presence may 
well be found to be not unconnected with the often very considerable 
disagreement between the diffusion coefficient measurements made on 
the same system by different observers. Since the Kirkendall effect, 
and therefore phenomena associated with it, is larger the greater the 
difference in concentration between the two halves of a diffusion sample, 
we might expect the most reliable and repeatable chemical diffusion 
coefficient measurements to be those conducted with the smallest 


practicable concentration gradients. A study of the dependence of D 
upon concentration gradient at fixed concentration might reveal to 
what extent these complicating features, associated with a Kirkendall 


effect, do, in fact, influence the measured value of D. But the rates of 
self diffusion of the two species of atoms in a chemically homogeneous 
alloy depend only upon the vacancy concentration and upon the pro- 
babilities of exchange of the atoms with vacancies. Since chemical 
homogeneity is maintained throughout the diffusion process the vacancy 
concentration remains always at its equilibrium value and none of the 
other complicating factors which may influence chemical diffusion 
rates and their measurement arise at all. Self-diffusion coefficients 
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of the atoms of an alloy have then a much simpler and direct significance 
and furthermore give directly the most fundamental quantity underly- 
ing all diffusion phenomena, namely the average number of random 
jumps made per second by each atom in a homogeneous alloy in 
thermodynamic equilibrium. Measurements of self-diffusion coefficients 
would probably be far more profitable in furthering a detailed under- 
standing of diffusion in alloys than the already very considerable 
number of chemical diffusion coefficient measurements have been. 


SECTION Il 


Grain BOUNDARY AND ScuRFACE DIFFUSION 


It has always been appreciated that in the neighbourhood of a grain 
boundary and on the surface of a metal the rate at which diffusion takes 
place may differ appreciably from that within a single crystal or grain 
because of the different structure which these regions present. We may 
define coefficients D,, D,,, and D, specifying respectively the rates of 
flow by diffusion in the lattice, in grain boundaries, and on the surface 
of a metal and a full description of the diffusion behaviour of any system 
requires a knowledge of all these coefficients and their variation with 
concentration. Such a knowledge is clearly important for a proper 
understanding of the practically important diffusion rates in poly- 
crystalline materials for in general these will be determined by D,, and 
D, as well as D,. Many of the discrepancies between the values of 
diffusion coefficients obtained by different workers on the same system 
for both self and chemical diffusion may well be connected with the very 
common failure to take proper account of the possible contribution to 
the observed diffusion flow by grain boundary processes, and which may 
have been more significant in some cases than others. D, may vary with 
the orientation of the surface and D,, with both the mutual orientation 
of the grains adjoining the boundary and with the direction of the grain 
boundary for fixed orientation. Studies of grain boundary and surface 
diffusion may then also provide information on the nature of those 
regions in which they occur. 

Whereas lattice diffusion can by suitable means be studied apart 
from grain boundary and surface diffusion, these processes themselves 
must necessarily be accompanied by diffusion in the lattice of the crystals 


or grains with which they are so intimately associated, and so cannot 
be studied apart from lattice diffusion. Experimental methods are, 
therefore, directed towards a comparison of the rates of flow by grain 
boundary or surface diffusion with that by lattice diffusion. 
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Grain Boundary Diffusion 


Two principal methods of investigation have been employed for grain 
boundary diffusion studies. 

I. The comparison of the overall rates of diffusion in materials of 
various grain sizes but similar in all other respects. It has generally 
been assumed in the past*! but as we shall see, erroneously, that only 
when the measured bulk diffusion coefficient varies with grain size is 
D,, appreciably different from D,. 

Il. By examination of individual grain boundaries in sections cut 
from diffusion samples parallel to the direction of diffusion the relative 
amounts of diffusion at the grain boundaries and in the neighbouring 
grains may be directly observed. This method usually requires the use 
of a suitable etch which will reveal differences in composition. It is 
clearly an essential condition for full advantage to be taken of this more 
informative method that there should be no changes in the grain 
boundary pattern during the diffusion. If the grain boundaries do 
move as a result of recrystallization or grain growth any difference 
between grain boundary and lattice diffusion rates may be completely 
masked, and even if not completely so the results will anyway be of 
indefinite quantitative significance. 

In the comparatively few studies of grain boundary diffusion that 
have been made, it has been reported that D,, and D, do not always 
differ but that when they do, they, and the corresponding activation 
energies Q,, and Q,, where these have been estimated, are such that D, is 
less than D,, and (, greater than Q,,. This is not surprising in view of 
the more open structure presented for diffusion at grain boundaries. 
This early work and its results have been fully reviewed on other 
occasions.*!, 25, 26, 50 

More recently a much needed mathematical analysis of simultaneous 
grain boundary and lattice diffusion has been presented®’ and has 
provided a method much less objectionable than hitherto, for the 
quantitative measurement of the ratio D,,/D,. Quantitative estimates 
of D,, have been made in the past but these have usually been based on 
a far less satisfactory analysis, often employing, for example, the 
Dushman-Langmuir equation.** Application of the analysis has been 
made to a few systems and it has also shown that method I will not 
necessarily always reveal a difference between D, and D.,, even if a very 
appreciable difference does in fact exist. This method must then be 
applied with caution and many reports for various systems studied by 
it, that D,, and D, are appreciably the same, may, therefore, be false. 
A second important new advance has been a study for the first time of 
the dependence of grain boundary diffusion rates upon the orientations 
with respect to the boundary of the two adjoining grains. 
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Methods I and II have both been used to reveal a preferential grain 
boundary diffusion in a few more systems in which it had not previously 
been observed, although much Of this has been in connexion with the 


work mentioned in the last paragraph 


Simultaneous Grain Boundary and Lattice Diffusion 


FisHer*’ has considered diffusion into a semi-infinite bi-crystal with its 
grain boundary normal to the surface, as shown in section in Fig. 5. 
Several assumptions are made to simplify the analysis. (1) The 
concentration ©, of diffusing material at the surface y 0 is main- 
tained constant throughout the anneal. (2) The grain boundary is 





Fig. 5. Diffusion in the Neighbourhood of a Grain Boundary 


regarded as a uniform slab of material of width 4 in which the diffusion 
coefficient D., is much greater than that for diffusion in the adjoining 
crystals, D,: the grain boundary region is not, of course, so well 
defined a region as this. Material diffusing more rapidly along the grain 
boundary than elsewhere will be lost from it laterally into the adjacent 
grains so producing an effective penetration of material which is greater 
in the neighbourhood of the grain boundary than the uniform penetra- 
tion by lattice diffusion alone in regions well removed from the grain 
boundary. It is assumed (3) that this lateral loss occurs by diffusion 


only in a direction normal to the grain boundary, (4) that D, and D,, 


are independent of concentration, and (5) that the concentration in the 
grain boundary is uniform across its width 46; this is taken into account 
in drawing in Fig. 5, a line of constant concentration or a ‘concentration 
contour 

With these assumptions a numerical integration of the appropriate 
differential equations, for diffusion in the grain boundary and for 
lateral diffusion in the adjoining grains, gave a solution the behaviour 
of which had certain features which suggested a simple analytical 
expression for C,,, the concentration of laterally diffused material in 
the vicinity of the grain boundary. 

C., = Cy exp (— yy vV/2/(t,)*)Erfe(z,/2t,'*) 
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where z,, y,, and t, are dimensionless quantities given by 
ry r/o, ¥Y,=Yy [d(D.,, D,)' a ® t, = Dt/é? 


It is to be emphasised that while this represents a useful and practi- 
cable relation for studying grain boundary diffusion, and so far as it has 
yet been tested** leads to results which are confirmed by experiment, it 
is obtained only by making very considerable simplifications in the 
mathematics and a more detailed and rigorous study of the problem 
would be very welcome. Failure to bear in mind the many assumptions 
upon which equation (3) is based may lead to serious errors in the 
interpretation of experimental results. 

Let us now consider diffusion into a polycrystalline sample. For 
simplicity we assume this to have a columnar structure so that all the 
grain boundaries are parallel to the direction of diffusion and intersect 
the free surface normally as does the one in Fig. 5. In any thin section, 
of width Ay cut norma! to the direction of diffusion at a distance y from 
the free surface y = 0, a certain amount A, of the material which has 
diffused into the section will have been transferred only by lattice 
diffusion through the grains and the remainder A,, by diffusion down 
the grain boundaries and by subsequent lateral diffusion from them. 
If there are n grains with average linear dimensions R normal to the 
columnar or diffusion direction, than A, is given by 


A, = nR*. Ay . CyErfe(y/2(D,t)'*) aa. 6°, ak 


where C, is the supposedly constant surface concentration of diffusing 
material and ¢ is the annealing time. If we assume that the total 
amount of material actually in the grain boundaries is negligible com- 
pared with that which has been lost by lateral diffusion, we may write 
for A,, 

A,, = 2nR. Ay ino » ae er (5) 
where C,, is given by equation (3). TurnBULL** has used equations 
(4) and (5) to study the relative importance of transfer by lattice and 
by grain boundary diffusion in diffusion experiments on polycrystalline 
material, and has presented some results, applicable to a typical set of 


experimental conditions, in the form of Table 1. This shows for various 
values of D,,/D, the grain size for which A, and A,, are equal, when 


TABLE 1 





D,,/D 10° 105 104 
Gram aize for whic 


Ag = A; (om 0-26 0-0014 | 2-15.10 
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y = 0-2 cm, t = 10° sec, and D, = 10~* cm?/sec, conditions such that 
the concentration at y of material penetrating by lattice diffusion 
is ~ 15 per cent of cy. If we may define lattice diffusion as predominant 
when A, > 10A,,, then since A,/A,, ~ R, experiments in which a 
measured diffusion coefficient showed no change when the grain size 
was varied from 2-6 to 0-014 cm would not be evidence for assuming 
that D,, and D, were appreciably the same, but only that D,,/D, < 10°. 
Thus we may only expect method I to reveal a difference between D,, 
and D, when a very considerable range of grain sizes is employed. 
Alternatively, since Q,, is usually less than Q,, the ratio D,,/D, is 
increased with decreasing temperature, so that method I may produce 
positive results at sufficiently low temperatures where D.,, is so much 
greater than D, that A,, is comparable with A, for the grain sizes used. 

When D,, is appreciably greater than D, material will diffuse much 
further into the sample in the neighbourhood of a grain boundary than 
elsewhere and if sections are cut at a greater distance from the surface 
y = 0 than the maximum depth of penetration by lattice diffusion 
alone through the grains, their content of diffusing material will be 
given by equation (5). Performing the integration and taking logs, we 
find*’ for A,,/Ay the content per unit thickness, 


log (A,,/Ay) = — [2"*(D,/D,,6)'*/(7D,t)"*\y + constant (6) 


where the constant varies with ¢ but does not contain y. 

This relation has been used by Horrman and TuRNBULL” to deter- 
mine for the first time a grain boundary self-diffusion coefficient, that of 
silver. A thin layer of radioactive silver was electroplated on to one 
face of a number of polycrystalline samples, which were then annealed 
at temperatures in the comparatively low range 500-700°C. It was 
assumed that surface diffusion and diffusion in the highly disorganized 
thin electroplated layer would be sufficiently rapid to maintain a fairly 
constant surface concentration necessary for the validity of equation 
(3). The log of the specific activity of thin layers cut parallel to the 
surface after anneal was found to vary linearly with their distance y 
from the surface. Transfer by purely lattice diffusion would have 
required a proportionality to y*; that this was not found may be 
regarded as evidence in favour of FisHEeR’s equations (3) and (6). 
Knowing D, (see equation (8)) and assuming a value of 5 . 10-* em for 4, 


they found from equation (6) that for high purity silver 


D,, = 0-03 exp (— 20,200/RT) cm*/sec ... . (7) 


At about 500°C, therefore, D,,/D, ~ 10°. 
To define D,, properly requires the validity of assumption (2) above 
and to calculate it from equation (3) or (6) requires a knowledge of 4, 
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for the equations only give the product (D,,6). But it may be more 
useful® to characterize grain boundary diffusion behaviour in terms of 


this measured quantity rather than D,, alone for it is more significant 


in representing, without reference to 6, the amount of material diffusing 
in unit time along the grain boundary and across unit length of it 
normal to the diffusion direction, the concentration gradient also 
being unity. 

HoFrrMan and TURNBULL” have also measured, by standard methods, 
the self-diffusion coefficients of silver in both single crystal and poly- 
crystalline material, with the results shown in Fig. 6. At high tempera- 
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Fig. 6. Lattice and Grain Boundary Diffusion in Silver 


tures single crystal and polycrystalline samples show the same self- 
diffusion coefficients and these results, which fit closely the earlier high 
temperature measurements of JoHNson*™ on coarsely polycrystalline 
silver, may, therefore, be taken as representing the true lattice diffusion 
coefficient. The equation to the full line is 


D, = 0-895 exp (— 45,950/RT') cm/sec... . (8) 


so that Q,, ~ 4Q,. But at lower temperatures the apparent diffusion 
coefficient for polycrystalline material is appreciably greater than that 
for single crystals and for experiments between 500 and 650°C fit more 
closely the equation 

D,,,. = 2:3. 10-5 exp (— 26,400/R7') cm?/sec . . . . (9) 


app 
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It is clear that at high temperatures and with the grain sizes employed 
(~ 10-* cm) D.,, is not sufficiently greater than D, for the contribution 
to the overall diffusion flow by grain boundary diffusion to be significant, 
but that at lower temperatures where D_,/D, is greater, grain boundary 
diffusion is contributing appreciably. 

These experiments are quoted in full because they illustrate a very 
important point in connexion with all experimental determinations of 
diffusion coefficients in polycrystalline materials which purport to give a 
measure of the lattice diffusion coefficient. It is usually considered that 
the more extensive the temperature range over which such measure- 
ments are made, the more accurate will be the resulting values of D, 
and Q (in the equation D = D, exp (— Q@/RT)) derived from them. 
We now see that they may represent lattice diffusion only at high 
temperatures but that abnormally high values of D may be obtained 
at the lower temperatures due to appreciable grain boundary diffusion. 
Experiments are not usually sufficiently accurate or numerous to reveal 
a departure at low temperatures from a linear relation between log D 
and 1/7’, so a “best straight line’ drawn uncritically through both high 
and low temperature results, when plotted as log D against 1/7’, may 
then produce values of D, and Y which are of no particular theoretical 
significance and both less than the true values for lattice diffusion, 
which it is often assumed they represent. In fact the results for high 


temperatures in Fig. 6 with those at low temperatures for polycrystal- 
line material, together lie quite closely to a straight line with equation 


D=8.10° exp (— 38,000 RT) 


D, is very appreciably less than, and Q lies between, the true values for 
lattice and for grain boundary diffusion. 

HorrmMan and TURNBULL took a contact autoradiograph of a taper 
section of a diffusion sample, cut in the diffusion zone and at a small 
angle to the plated surface. Blackening was strongest along a series of 
lines which corresponded exactly with the traces of the grain boundaries 
revealed by etching the same sample, and the lines thicker on that side 
of the section nearer the plated surface. This indicates, independently 
of FisHER’s analysis, that D., is, in fact, greater than D, for self-diffusion 
in silver. The same experiment has been performed with tin by 
FensHAM,™ with similar results. 

Method [I (p. 281) has been used in the past only to reveal quantita- 
tively any difference between D,, and D,. By suitable choice of etch, 
or by use of contact autoradiography in grain boundary self-diffusion 
studies, it may yield results from which concentration contours as in 
Fig. 5 could be obtained. Their shape at distances along the grain 
boundary greater than the maximum depth of penetration by lattice 
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diffusion alone is governed by equation (3) and can, therefore, be used 
to test this equation and to derive values of (D,,6). 

Le CLarRE*” pointed out that simple differentiation of equation (3) 
shows the angle ¢ at which a concentration contour meets the grain 
boundary to be simply related to D,,6, independently of concentration, 
by the equation 

D,,6/D, = 2(2D,t)'* cot? ¢ fa ss OS 


This is derived directly from equation (3) without further assumptions, 
so that observations on this one simple geometrical feature of concentra- 
tion contours can be used to check FISHER’s equation. If valid then 
when D,, and D, are assumed independent of concentration ¢ is the 
same whatever the concentration at the contour, and for observations 
on the same grain boundary after successive anneals, tan‘ ¢ should be 
proportional to ¢. 

It also provides a quick and convenient method for determining D,, 
and its variation with temperature and orientation. Barnges* has used 
method II to show that copper diffuses more rapidly in the grain 
boundaries than in the grains of nickel and the etchant used was such 
as to reveal several concentration contours. 

Application of equation (10) showed that at 1000°C for this system 


Dy D, ~ §. 105 


The depth of penetration by grain boundary diffusion in this system 
is also greater than that by lattice diffusion by an amount which 
decreases markedly with increase in temperature, suggesting that Q 
is again less than Q). 


gb 


Orientation Dependence of Grain Boundary Diffusion 


Beck, Sperry and Hv*™ have suggested that their observations, that 
grain boundary migration during recrystallization is more rapid the 
larger the disorientation between the grains flanking the boundary, 
may be explained by assuming that the rate of diffusion of atoms within 
the grain boundary varies with orientation. But the first direct study 
of the variation of grain boundary diffusion rates with relative grain 
orientation has been made recently by ACHTER and SMOLUCHOWSEI,* 
for silver diffusing in copper. BERMAN and HarrineTton* had previ- 
ously shown qualitatively for this system that silver diffused into copper 
more rapidly in the grain boundaries than in the adjoining grains. 

For convenience in interpretation of the results, ACHTER and 
SMOLUCHOWSEI used specimens of “columnar” copper, in which all 
grains had approximately a common cubic direction and their grain 
boundaries parallel to this direction. Two co-ordinates completely 
specify the configuration of each grain boundary in such a sample, 
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rather than the five required in the more general case. The relative 
orientation of the two grains flanking the boundary is given by the 
angle 6 between a pair of cube axes normal to the columnar direction, 
and the direction of the grain boundary by the angle # between one of 
the cube axes and the plane of the boundary. Diffusion couples of Cu 








Extent of Diffusion X vs. Orientation Difference 
141 hr at 725°C 
Extent of Volume Diffusion 


Extent of Grain Boundary Diffusion 


and Cu + 4 per cent Ag were welded together with the columnar 
direction in the copper normal to the interface, so that all grain boun- 
daries were parallel to the direction of diffusion of the silver. After 


annealing, samples were quenched to retain the silver in solid solution. 
The maximum depth of penetration of silver down the grain boundaries 
was determined by etching consecutive sections lapped parallel to the 
interface and noting at what depth from it for each grain boundary 
silver, as revealed by the etchant, could no longer be detected. The 
amount of silver in each grain boundary decreased with increasing 











Fig. 8. Columnar Diffusion Sample cut parallel to Interface 


distance from the interface and the maximum depth of penetration was 
found to vary with the relative orientation 6, as determined by etch pit 
methods, in the interesting manner shown in Fig. 7. The appearance 
of a typical etched section is shown in Fig. 8. At angles 6 < 20° and 

70° the silver does not penetrate along the grain boundaries any 
further than it does into the grains by lattice diffusion alone, but for 
6 > about 20° the grain boundary penetration depth exceeds the lattice 
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penetration by an amount which increases as 6 increases from 20°, 
reaches a maximum at 45° and then falls to zero again at 70°. The 
results also suggested, but only vaguely, that the preferential grain 
boundary penetration may be least where the grain boundary is 
symmetrical with respect to the lattices of the adjoining grains, so that 
B = 6/2. 

It would appear then that grain boundary diffusion rates are not 
simply or directly related to the grain boundary energy, since Reap 
and SHOCKLEY*®’ have shown the energy to increase continuously as 6 
increases from zero up to about 20°. 

The results were based upon observations made on pairs of grains 
which deviated from the perfect columnar structure by less than 7-8° 
but there were many pairs“ with an angle y between those (100) 
directions most nearly parallel to the general columnar or diffusion 
direction, which was appreciably greater than 8°. At a junction of three 
grains in a section perpendicular to the diffusion direction it often 
appeared that silver at a silver rich boundary had diffused perpendicu- 
larly to the columnar direction a little way into a silver free boundary as 
at (a) in Fig. 8. Such situations enabled a comparison to be made of 
diffusion rates in two mutually perpendicular directions in one grain 
boundary. It appeared“ that even when 6 was less than 20°, so that no 
preferential grain boundary diffusion occurred along the columnar 
direction, an appreciable diffusion perpendicular to the columnar 
direction in the same boundary could occur provided y was greater 
than 20°. 

These results reveal not only a dependence of D,, upon the relative 
orientation of the grains but also, for fixed orientation, upon the 
direction within the grain boundary, i.e. an anisotropy of grain boundary 
diffusion. 

In an earlier series of experiments using a slightly different method, 
ACHTER and SMOLUCHOWSKI®™ estimated the activation energy for 
grain boundary diffusion of Ag in Cu of about 23,800 cal/mole compared 
to the lattice diffusion activation energy estimated at the same time, 
of about 38,300. 

Quite recently all this work has been criticized by Dawson,** who 
suggests that it may only be a study of grain boundary precipitation 
as a function of solute supersaturation and orientation difference. He 
supposes that the silver diffuses into the copper everywhere by normal 
lattice diffusion and that on cooling the grain boundaries merely act 
as preferred sites for precipitate nucleation. However, ACHTER and 
SMOLUCHOWSEI™ point out that the manner in which the silver is 
distributed in and around the grain boundaries is inconsistent with this 
viewpoint as also is the observed anisotropy and low activation energy. 
But even more recently FLaNaGaN and SMOLUCHOWSKI® have extended 
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this work to the diffusion of zine in copper, a system free from precipita- 
tion effects but in which a variation of grain boundary penetration 
with orientation was observed very similar in nature to that of the silver- 
copper system. Dawson's criticisms seem then unfounded. A grain 
boundary activation energy of 24,500 cal/mole has been estimated™ 
for Zn diffusing in Cu compared to a lattice diffusion activation energy 
of 34,000 cal/mole. 


Grain Boundary Models and Diffusion 


Many of the above results have been discussed by AcnTEeR, FLANAGAN 
and SMOLUCHOWSKI™: *,. in terms of some of the various models of a 
grain boundary that have been proposed by other workers (see PMP.1, 
Chapter 3). If we suppose the boundary to consist of a suitable array of 
edge dislocations then an inference from the anisotropy observations is 
that diffusion is most rapid in a grain boundary along the lines of the 
edge dislocations which compose it provided there is a difference in 
orientation between adjacent grains of more than 20°, but no faster than 
ordinary lattice diffusion in a direction perpendicular to the dislocation 
lines. For when y is small and 6 > 20° the dislocation lines run most 
nearly along the columnar direction, in which the grain boundary diffu- 
sion is rapid, but when y > 20° and @ small, as in some pairs of grains 
deviating considerably from the columnar structure, dislocation lines 
run most nearly perpendicular to the general columnar direction and it 
is only in this direction now that grain boundary diffusion is rapid. 
That a disorientation of as large as 20° is necessary before any prefer- 
ential grain boundary diffusion becomes apparent, strongly suggests 
that the mobility of atoms along individual edge dislocations is not 
appreciably different from their normal lattice mobility, for provided the 
angle is less than about 20° the dislocations composing the grain boun- 
dary are still individually distinguishable and separate from one 
another. But the angle of 20° at which the grain boundary rate begins 
to be appreciably greater than that of lattice diffusion, is just in the 
range where the distance between dislocations becomes so small that 
their individuality is lost ; the grain boundary is now better represented 
by such a model as that of Mott,” who pictures it as consisting of 
islands of good lattice fit surrounded by regions of misfit, which we may 
regard perhaps as formed by the grouping together of now individually 
indistinguishable dislocations. This change in structure of the grain 
boundary as @, increasing, passes through a value of about 20° has been 
demonstrated by Lomer and Nye“ with the Brace bubble model, and 
appears to be made manifest by the grain boundary diffusion behaviour. 
While the isolated regions of misfit described as single dislocations pro- 
vide no special facilities for enhanced grain boundary diffusion, because 


perhaps the void space associated with them is too small to play the role 
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of vacancy and permit passage of a diffusing atom, (we should not then 
expect such sub-structural features of single crystals as mosaic boun- 
daries to influence diffusion rates) the more extensive regions of misfit, 
which they combine to produce at larger angles, do allow more rapid 
diffusion because perhaps of their increased void space. Twin boundaries 
are regions of good lattice fit even if the relative orientation of the twin 
grains is large, and accordingly it is found that the diffusion rate in 
them is less than corresponds to the relative orientation, as shown by 
the cusp in Fig. 7 at angles corresponding to twinning on {210} planes. 
BaRNEs® found no evidence for any increased diffusion of copper at 
the presumably {111} twin boundaries of nickel. The islands of good 
fit will be elongated, and the arrears of misfit, therefore, less interrupted, 
along the same direction as that of the dislocation lines at lower angles 
of disorientation, so that grain boundary diffusion will be preferred 
along this direction and the anisotropy as observed at angles greater 
than 20° is still explicable. 

It will be clear that FisHer’s assumption (2), and therefore his equa- 
tion (3), is most nearly valid for large angle boundaries where the 
regions of misfit will be extensive fiat areas covering a large proportion, 
if not all, of the grain boundary. Such boundaries are believed** to have 
been predominant in the polycrystalline silver used by Horrman and 
TURNBULL in the experiments described above. At smaller angles the 
regions of misfit will be less extensive, and for @ only a little greater 
than ~ 20° may be regarded'** as rods or channels of distorted struc- 
ture, separated by relatively undistorted areas. D,, and Q,, are now to 
be taken as referring to diffusion in the distorted misfit regions only of 
a grain boundary. Now the maximum depth to which material pene- 
trates by diffusion down a grain boundary is determined both by D,, 
and by the rate, which depends on D,, at which material is lost from the 
boundary into the adjoining grains. For low angle grain boundaries 
this loss of material, being in all radial directions from each of the rod 
shaped regions of misfit, will, relative to the flux of material down the 
misfit region themselves, be greater than for high angle grain boundaries 
where the loss is principally by planar diffusion from the extensive 
flat areas of misfit. The maximum depth of penetration, for this reason 
alone, will therefore increase with increasing values of 6, above a certain 
value, as shown in Fig. 7. 

Since Y,, < Q,, the rate of loss of material from the boundary 
increases more rapidly with temperature than the rate of flow down the 


boundary itself. Thus the apparent activation energy, calculated from 


measurements of the variation with temperature of the maximum depth 
of penetration, will be less than the true activation energy Y,, and will 
vary with the structure of the boundary, i.e. with the angle 6, being 
lowest and possibly even negative for the lower angle boundaries 
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(~ 20°), where the loss into the grains is relatively more important. 
For 6 < ~ 20°, of course, where there is no preferential grain boundary 
diffusion, the apparent activation energy equals Q,. 

FLANAGAN and SMOLUCHOWSKI® have recently studied the tempera- 
ture dependence of the grain boundary diffusion of zinc in copper and 
have found an orientation dependence for the apparent activation 
energy of just the type expected from the above described model. 
Taking into account the changing structure of the grain boundary with 
§, SmMoLucHOWsSKI™™ has presented a theoretical discussion of grain 
boundary diffusion, with particular reference to the apparent activation 
energy, and obtained values for this which vary with # in a manner 
closely agreeing with that observed in the zinc in copper experiments. 

Little consideration has yet been given to the precise mechanism 
of diffusion in a grain boundary. Although the tendency is to discuss 
this in terms of the vacancy mechanism, the distinction between the 
various mechanisms which have been proposed for lattice diffusion is 
much less clear in the disorganized regions of a grain boundary; it 
may be quite impossible for example to distinguish in the boundary 
between an interstitial position and a vacant lattice site.* The 
observed anisotropy and the nature of the variation of D with orienta- 
tion indicate that the increase in diffusion rates at grain boundaries 
arises mainly from the effect their changed structure has on the mobility 
of the atoms composing them rather than that it is principally due to the 
grain boundaries providing a higher concentration in their neighbour- 
hood than elsewhere of, say, vacancies, and so increasing diffusion rates 
over a region on either side of the grain boundary more extensive than 
that over which the lattice is irregular. In the latter case 6 would be 
appreciably greater than the rough value of 5 . 10-* cm usually assumed 
for it. 

It was at one time suggested that the same unit atomic process was 
responsible for grain boundary slip, for constant rate creep in single 
crystals and for volume diffusion because their activation energies 
appeared to be the same, at least for «-Fe, «-brass and Al. The unit 
process considered was an exchange or re-arrangement of atoms in what 
Ké described™ as “disordered groups’ of atoms. These exist, he 
supposed, in certain concentration within the grains and in much higher 
concentration where they formed grain boundaries. Since they would 
also give rise to grain boundary diffusion, the inference is drawn that 
Q,, should also be equal to the activation energy for the three above- 
mentioned processes. Of particular interest is the predicted equality of 
Q,, and Q,. But apart from the fact that experiment shows Q,, to be 
always quite appreciably less than @,, a recent and more accurate 
redetermination of the activation energy for self-diffusion in iron® and 
a study of self-diffusion” and of grain boundary slip* in tin, show that 
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the activation energies for K£’s three processes are certainly not equal 
for tin and iron, the activation energies for self-diffusion being appreci- 
ably less than those for grain boundary slip and for creep. These 
processes are not then all intimately related to one another through a 
common unit process, or to grain boundary diffusion and there is no 
reason for expecting Q,, to equal Q@,. Should they be equal then 
differences between D,, and D, would reside wholly in the constant 
term in the Arrhenius equation; a grain boundary model which would 
then explain the observed differences has been discussed by Lz CLarre.*° 

It has often been suggested that impurities in metals may have a 
tendency to segregate in grain boundaries and the first convincing 
demonstration of a case where this is so has recently been given’™—the 
case of Po impurities in PbBi alloys. The polonium impurity was 
produced by decay of radioactive Bi incorporated in the alloy and 
although present at as low a concentration as 1 part in 10", was shown 
by autoradiography to be present in the alloy after annealing in much 
higher concentration at the grain boundaries than elsewhere. Impurities 
which do segregate in this way will most probably have an effect on 
grain boundary processes out of all proportion to their total concentra- 
tion. Thus K£& found® that very small amounts of impurity completely 
block grain boundary slip in aluminium, although Horrman and 
TURNBULL” found little change with purity in the observed grain 
boundary self-diffusion rates in silver. BrrcHENALL and MEuHL,*” 
however, found that while at high temperatures the self-diffusion 
coefficients in polycrystalline samples of iron of various purities were 
substantially the same, at low temperatures it was greater in the iron of 
commercial purity than in more pure material. They suggest that 
perhaps in a pure metal D,, and D, are normally the same but that 
impurities accumulating in grain boundaries have the effect, at least in 
commercial iron, of increasing D,, and so also the apparent low tempera- 
ture self-diffusion coefficient. But these results are not conclusive 
because they had no quantitative evidence on the relative grain sizes 
of the various purity materials ; the apparently smaller grain size in the 
less pure iron, if in fact sufficiently smaller than that of the pure iron 
would equally well explain the results. 

It would be surprising if impurities alone were responsible for the 
increase in grain boundary diffusion over lattice diffusion rates but their 
importance in affecting values of D,, is a matter which merits further 
and more extensive investigation. 

Surface Diffusion 
A number of interesting experiments have demonstrated the very rapid 


mobility of surface metal atoms and summarizing reviews”: * of these 
are already available in the literature. In a very few cases quantitative 
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estimates of D, and of Q, have been made which show Q, to be less than 
both Q,, and Q,, and D, to be greater than D,, and D, which from the 
nature of the process is not unexpected. In the last few years some 
further contributions have been made in this field which generally 


) 


confirm these results. 
It will be clear that FisHer’s analysis*’ of grain boundary diffusion 


is, with similar assumptions, directly applicable to surface diffusion if 
in his equations D., is replaced by D, and 6 by }0,, 6, being the effective 
width of the diffusing surface layer. The factor } arises because material 


is, of course, only lost by lateral diffusion into the grain from one side 
of the surface layer. But no use yet appears to have been made of this 


analysis in surface diffusion experiments. 
NICKERSON and PARKER™ have made measurements on the surface 
self diffusion of silver and obtained the result 


D, = 0-16 exp (— 10,300/RT)em*/sec ... . (11) 


Radio active silver was deposited at one small region of a bundle of 
silver wires and the surface diffusion coefficient calculated from the 
distribution along the wire of the active silver after annealing at various 
temperatures between 225 and 350°C. To measure the distribution the 
wire was scanned with a counter tube fitted with a lead collimator and 
thin slit so that at each position the activity of only a small section of 
the wire was measured. However, the results were analysed without 
regard to loss of material from the surface into the grains: the wires 
used were, in the absence of any indication to the contrary, presumably 
polycrystalline so that surface atoms would also be lost into grain 
boundaries. For these reasons and in view of the considerable scatter 
among the results equation (11) represents only a rough estimate of D,,. 
However, taken with Horrman and TURNBULL’s results we now have 
a more complete account of self-diffusion processes in Ag than in any 
other metal. 

A similar method has been used by FRaAUNFELDER™ to study the 
surface diffusion of copper (Cu®) on silver with the single result that at 
750°C, D, = 7. 10-* cm? /sec. 

Mt.ier®™ has used an ingenious method to study the surface self- 
diffusion of tungsten. The potential difference required to maintain a 
given electron emission from a fine tungsten point is a function of its 
radius of curvature. Changes in shape occur when the point is main- 
tained at a high temperature (1600 to 2500°C) and assuming such 
changes are due to surface diffusion MULLER has calculated from the 
change in potential with time that the activation energy for surface 
self-diffusion of tungsten is 106,500 cal/mole. This is appreciably less 
than the value of 143 cal/mole for lattice self-diffusion estimated by 
Van Lirempt.* It is interesting to note that the ratio QY,/Q, is much 
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higher for tungsten than for silver but it should be remembered that 
Van LiEmpPtT’s estimate is not from direct radio active measurements. 
He merely assumes that his values of the coefficient for diffusion of iron 
in tungsten, being measured at very low iron concentration, represent 
also the self-diffusion coefficient of tungsten. His value for Y, may then 
be an underestimate of the true value. 


SECTION Ill 


THE MECHANISM OF DIFFUSION 


1. Diffusion in Interstitial Solid Solutions 

It has never been doubted that the diffusion of solute atoms in inter- 
stitial solid solutions occurs by successive jumps of the solute atoms 
from one interstitial position to another, for it is difficult to imagine 
any alternative mechanism in this case. 

Recently there have become available new methods for measuring 
interstitial diffusion coefficients (see Section IV) which extend the 
temperature range over which such measurements may be made and so 
provide more accurate values than have before been available of the 
theoretically important quantities D, and @ in the usual equation 
employed to express the temperature dependence of D, viz— 


D = D, exp (— Q/RT) ~ 2+ + « (12) 


Prompted by this important development, renewed attention has 
recently been given, notably by WERT and ZENER, *’: toa theoretical 
interpretation of Dp. 

Treating the process of diffusion as a random walk problem, it may 
readily be shown that for a cubic lattice the diffusion coefficient D for 
any one species of atom is given in terms of 7, the average time between 
successive jumps of each atom of that species, by the expression 

D = aa?*/r it ace Ge 


where a is the lattice constant and « is a geometric factor depending 
on the disposition round any one equilibrium site of neighbouring sites 
into which the atom may diffuse by jumping. « is easily computed and 
turns out, for interstitial diffusion, to be 1/24 for a b.c.c. and 1/12 for a 
f.c.c. lattice. The remaining problem is to determine 1/r. 

Werr”’ and Wert and ZENER®® have applied for this purpose the 
essential assumptions and methods of Eyrina’s® theory of reaction 
rates but have adapted it to this particular problem more carefully 
than previous workers. The most general expression for the rate of 
an activated process is, according to EyRING 


l/r = p.(kT/h). F*/F 
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where k, T and A have their usual meaning and p is the number of 
equivalent paths for diffusion of an atom from any one position of 
equilibrium (for interstitial diffusion p is 4 in b.c.c. and 12 in f.c.c. 
lattices). F and F* are the partition functions for the system in its 
initial and activated states respectively i.e. for an atom in its normal 
equilibrium position and at the top of the potential barrier separating 
it from the next position to which it is diffusing. Now in evaluating F*, 
the degree of freedom at the saddle point corresponding to movement 
over the barrier is omitted. /'*/F is usually written as exp (— AG’/RT'), 
where AG’ is referred to as the “free energy’ change in passing from 
the equilibrium to the activated state. But this quantity is not strictly 
a free energy for it is the ratio of two partition functions with unequal 
numbers of degrees of freedom. In order to obtain a similar ratio of 
partition functions with equal numbers of degrees of freedom WERT 
and ZENER® write 


F=F,.F, 


where F, is the partition function for a single linear oscillator of 
frequency v, this being considered to represent the vibrational motion 
of the atom about its equilibrium position and along the direction of 
impending diffusion. F, is then the partition function for the system 
with the atom constrained to move in a plane normal to the direction 
of diffusion and passing through the equilibrium position. For the 


temperatures at which diffusion measurements are made it is quite 
sufficient to approximate the usual expression for F, by F, = kT /hy 
so that we arrive at the result 


l/r = py. F* F, 


Since F* and F, now refer to two states with equal numbers of 
degrees of freedom, we may, in writing 


l/r = pv exp (— AG/ RT) ees 


legitimately refer to AG as a free energy change and so equate it to the 
isothermal work done in moving an atom from an equilibrium position 
to the top of the potential barrier separating it from the next equilibrium 
position, the atom only being constrained during this motion to move 
in directions normal to the line joining the two equilibrium positions. AG 
in (14) has therefore a clear physical significance. 

A derivation of equation (14) from first principles has been given 
by Wert.” 

Assuming the isothermal work to be done at constant pressure we 
may make use of the thermodynamic relation 
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in writing, from equations (13) and (14), the expression for D. 
D = paa*y exp (AS/R) exp (— AH/RT) . . . . (15) 
AS and AH are referred to respectively as the “entropy” and “enthalpy” 


(or heat) of activation. 
We also have the additional relation 


DAH (JAS 
| oT ~ r | oT . 


so it follows that 


2 log D 


and hence that AH in equation (15) corresponds with Q in equation (12). 
D, is, therefore, given by the relation 


Dy, = paa*y exp (AS/R) : =e « aa 


Wert and ZENER estimate » by assuming the potential energy of a 
solute atom to vary sinusoidally as it moves from one interstitial lattice 
position to another with an amplitude taken as equal to the heat of 
activation AH. With such an assumption they obtain 


y = (AH/2mi?)'” 


where / is the distance between interstitial positions and m is the mass 
of the atom. 

The only unknown quantity now appearing in the expression for D, 
is AS. This may be evaluated by noting the further thermodynamic 
relation. 

AS = — dAG 
oT 

If we assume that all the work AG appears as strain energy in the 
lattice surrounding the diffusing atom when at the saddle point, then 
we may put 

AG = (1/2)ue*v sss: 


where u is some shear modulus of the lattice, e a measure of the shear 
strain, and v the volume over which the strain is apparent. The 
variation with temperature of that part of the experimentally observed 
modulus which is associated only with lattice elasticity, and which does 
not contain a contribution from grain boundary relaxation processes, is 
found to be quite closely linear.“ Therefore, we may write 


a= wy + Tdu/dT oie < ee 
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and if we ignore any possibly small temperature variations of v and e, 
we find 


and 


AS Mult 1] (19) 


. 


Ou/OT is always found to be negative so that AS is essentially positive 
and exp (AS/R) always greater than one. 
Inserting the known or now estimated values of p, «, »y and a® into 


equation (16) values of AS... can be calculated from experimental 

















Theoretical and Experimental Values of \S/R for Interstitial 
Diffusion™ 
Oxygen in Ta—Ke2'** (1948 
Nitrogen in Ta—KeE'" (1948 
Carbon in Ta—K&'*" (1948) Werr® (1950 
Carbon in «Fe—Snoex'™ (1941) Stanitey'’® (1949) 
Wert and Zener™ (1949) Werr®’ (1950 
Nitrogen in «aFe—Syoex'™ (1941) Wert and ZENER” 
1949) Wertr® (1950 


values of D, and compared with the values of AS,,... given by equation 
19). For this latter calculation Wert has used the temperature 
coefficients of the polycrystalline bulk shear moduli as deduced from 
KOster’s* measurements for a large number of materials of the 
temperature dependence of Young’s moduli. A comparison of theory 
with experiment has been made by Wert® by plotting AS... against 
AS,,.. as in Fig. 9. Full agreement with the foregoing theory is 
indicated for those points which lie on the line of slope one. The most 
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accurate experiments to date are those for C in a-Fe®*’ in view of the 
very extensive temperature range (— 40 to 800°C) over which they 
have been made, and in this case the agreement is excellent. Werrt 
points out that both for this and other cases, repetition of measure- 
ments by different observers appears to lead to continuously improving 
agreement between experiment and theory and so by implication 
attributes remaining discrepancies to there still being room for improve- 
ments in the accuracy of D,. The arrows in Fig. 9 run from earlier to 
later experiments. 

In an earlier paper, WERT and ZENER® consider the possibility that 
not all the energy AG may go into straining the lattice in which case AS 
may be allowed to take any value between zero and that given by 
equation (19). The experimental values suggest, however, that AS 
always takes its full value. 

This work on interstitial solutions represents a considerable advance 
in the subject of diffusion, being the first successful theoretical inter- 
pretation of observed values of D,. Its success is due in no small 
measure to the recent availability of more accurate values of D, for 
previous attempts’ in this direction have been made by use of an 
equation very similar in form to number (15). The equation for 1/7 
was often derived simply by multiplying the probability that an atom 
had the activation energy necessary to carry it over the potential 
barrier, as given by the Boltzmann factor, by the number of times per 
second the atom was moving in the right direction to diffuse, i.e. ». 
Since the total energy of activation rather than the free energy appeared 
in these expressions, the equations resulting did not contain the term 
exp (AS/R). But attempts were sometimes made to remove the dis- 
crepancy between experiment and theory which thereby resulted by 
taking explicitly into account the temperature dependence of the total 
energy of activation. When the energies involved vary linearly with 
temperature, such a procedure leads to much the same expressions 
for D, and Q as the above and so with reliable data to work on, and 
perhaps in some cases a little more careful attention to geometric 
factors, could have succeeded in relating theory with experiment. 
However, it may be shown® to be thermodynamically quite unsound 
to employ Boltzmann factors containing total energies (in the sense of 
the first law of thermodynamics) which are functions of temperature ; 
the correct procedure in such cases is to replace the total energies 


normally appearing in the exponential by the corresponding free 
energies of the states to which they refer. Although the derivation of 
l/r which we have discussed is the more rigorous and much to be 
preferred, the earlier simple kinetic derivations would have produced 
the same results, without recourse to thermodynamically unsound 
practices, had the correct form of the Boltzmann factor been employed. 
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2. Diffusion in Substitutional Solid Solutions and Pure Metals 
It is by no means so obvious in this case by what atomic process diffusion 
occurs for even on casual refiection several possibilities occur. It 
becomes necessary then to consider these possibilities in turn and to 
choose from them, as a model upon which to base discussion, the one, 
the consequences of which can be related the more closely to experi- 
mental measurements and observations on diffusion phenomena. 

(a) Comparison of Observed and Calculated Activation Energies—The 
three types of process which have been the subject of most discussion in 
the past are the direct exchange mechanism, the vacancy mechanism, 
and mechanisms involving atoms in interstitial positions. These were 
described and discussed in PMP.1, where it was pointed out that while 
the vacancy mechanism had often been preferred in the past because it 
was felt that it would require a much lower energy of activation than 
the other mechanisms, this feeling was only confirmed, at least for the 
close packed metal copper, by Hun trineton and Serrz’s®: © direct 
quantum mechanical calculation of the activation energy Y for all 
three processes. (In the particular type of interstitial mechanism chosen 
for consideration by these authors, the interstitial atoms, assumed 
present in very small concentration in thermodynamic equilibrium with 
the lattice, diffuse by moving on to an adjacent substitutional site, 


pushing the atom occupying it on to the next interstitial site. This has 
been termed “‘Interstitialcy Diffusion” by Serrz to distinguish it from 
the more usual concept of interstitial diffusion, where the atoms move 
directly from one interstitial position to the next, and which would 


require a larger activation energy.) 

Their estimated values of Q were, in kcal/mole, 400 for the 
exchange mechanism, 230 for the interstitialey mechanism, and, with 
a more refined calculation,*’ 64 for the vacancy mechanism. When 
these are compared with the experimentally observed value of ~ 50 
for self-diffusion in copper, it becomes clear that the vacancy mechanism 
is by far the most likely of the three. A recent re-estimation®™ of the 
value for the exchange mechanism reduces it to about 240 and so does 
not vitiate this conclusion. The inference is usually drawn from these 
results that for all close packed lattices the vacancy mechanism 
represents most closely the actual process given rise to diffusion in them. 
But the calculations allow of no similar conclusion regarding the 
mechanism in non-close packed lattices. In fact, since it is the ion core 
repulsions which play the dominant part in rejecting the possibility of 
the exchange and interstitialey mechanisms in such a metal as copper, 
these mechanisms would be in much closer competition with the 
vacancy mechanism in calculations on less close packed metals, where 
the ion core repulsion is much reduced. 
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For a long time serious consideration was given only to the three 
above-mentioned mechanisms, but recently other mechanisms have 
been suggested representing more generalized versions of these. Thus, 
for example, ZeNER® has pointed out that the simple exchange 
mechanism is only a special case of what he terms “ring diffusion” in 
which a number n of atoms lying on a ring, so that each is a nearest 
neighbour of the one before and after it, diffuse simultaneously by 
synchronous movement round the ring. Fig. 10 illustrates this for 
n = 4in a f.c.c. and b.c.c. lattice. The exchange mechanism is clearly 
the simplest case of nm = 2. ZENER has carried out detailed calculations, 
similar to those of HunTiIneTon and SerrTz, of the activation energy 
required for ring-diffusion processes in copper. He shows that contrary 











| 











Fig. 10. 4-Ring Diffusion*®* 


(a) f.c.c. lattice 
(b) b.c.c. lattice 


to what might at first sight be expected the activation energy does not 
increase with n, but is appreciably less for n = 4 than for other values. 
Although more atoms are involved in a 4-ring than in a simple exchange 
or 2-ring process, the disturbance attending it in the surrounding lattice 
and hence the energy required for its performance, is less. In fact, the 
coulomb interaction of the closed shells at the saddle point is almost 
negligible and it is only the exchange interaction which contributes 
significantly to the activation energy. ZENER finds this to be ~ 90 kcal, 
an estimate which, although very considerably nearer than those for 
the simple exchange and interstitialey mechanisms to the observed 
value of ~ 50 kcal/mole, is still by no means in as close agreement with 
it as the estimate of 64 for the vacancy mechanism: and such refine- 
ments to the calculations which might be expected to bring it closer to 
the experimental value would, at the same time tend further to improve 
the agreement with experiment of the estimate for vacancy diffusion.” 
There would then appear to be no very strong grounds for replacing the 
vacancy by the ring mechanism as the most likely process of diffusion 
in close packed metals. As we shall see shortly there is in fact stronger 
evidence against the ring mechanism: but in the absence both of such 
evidence and similar calculations for less close packed metals ring diffu- 
sion must remain in their case a possible process to be considered. 
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PANETH” has calculated the activation energies for diffusion by the 
above-mentioned processes in the alkali metals and in this way has been 
led to consider yet another process of diffusion representing a generaliza- 
tion of the interstitialey process. He found that while the energy to 
form an interstitial atom in sodium is very much less than for copper, 
the energetically most favourable position for the interstitial atom is 
actually midway between a pair of nearest neighbours, for an atom in 
such a position produces a very considerable amount of relaxation by 
the neighbours and by atoms lying along the line of the neighbours. 
A defect is thereby produced, termed a “‘crowdion” which may be 
described as a row of n nearest neighbour atoms compressed so as to 
occupy a distance normally occupied by n — | atoms, or as PANETH 
puts it, as an interstitial atom diluted over several atomic spacings. 
Diffusion then occurs by propagation of the region of compression along 
the direction of the crowdion and PaneTs shows that a crowdion of 
about 8 atoms has a much lower energy of activation for diffusion than 
either the vacancy or ring mechanism. PANETH’s estimate of c . 7 
kcal/mole for sodium awaits experimental confirmation. 

By way of a generalization of the simple vacancy model we may 
mention that Serrz™* has discussed the possibility that diffusion may 
occur’ by the migration of coupled pairs of vacancies and suggests that 
this may be faster than diffusion by single vacancies because of the 
lower closed shell interaction during the exchange of an atom with 
either of the vacancy pair. Such double vacancies are thought” to play 
a role in the changes in resistance of metals on annealing after plastic 
deformation, and may be of importance in chemical diffusion on account 
of the Kirkendall effect, as mentioned in Section I. 

(b) Evidence from the Kirkendall Effect—A necessary and important 
property of the exchange or ring diffusion mechanisms is that they can 
never lead to any nett displacement of atoms relative to the crystal 
lattice or to any changes in volume or shape as a result of diffusion, 
other than those arising from differences in the atomic sizes of the 
interdiffusing species. The work of Smicetskas and KIRKENDALL, 
reviewed in PMP.1 and many more recent experiments designed mostly 
to confirm and extend it and reviewed in Section I, have together 
demonstrated that all these forbidden phenomena are, in fact, associated 
with diffusion, at least in f.c.c. lattices, and so provide the most direct 
evidence to date against an exchange or ring mechanism in such 
structures. A Kirkendall effect has now been observed in about nine 
different systems and as these are all of the face centred cubic structure 
we may probably assume that a ring mechanism is most improbable in 
all alloys of this structure. No Kirkendall effect experiments have yet 
been performed which would assist in elucidating the mechanism of 
diffusion in alloys of other structures. As shown in Section I the 
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vacancy mechanism does provide a satisfactory picture of all the 
phenomena associated with the Kirkendall effect, and although an 
interstitial mechanism might also be invoked to do the same, the 
calculations referred to in the last section, with the evidence mentioned 
here, provide very strong evidence for a vacancy mechanism. 

(c) Miscellaneous Evidence Relating to a Vacancy Mechanism—lIt 
has often been proposed that if diffusion occurs by vacancies then 
since the equilibrium concentration of vacancies increases rapidly with 
temperature it ought to be possible to quench a metal so as to produce a 
concentration of vacancies in excess of that corresponding to equilibrium 
at the temperature to which the material is quenched. But it appears 
from studies” of the problem that the normal density of dislocations in a 
metal, which can act as sources and sinks of vacancies, is, at the 
temperatures at which conventional diffusion experiments are made, 
sufficient to restore equilibrium in a time very short compared with that 
required for the measurements. The excess vacancy concentration in 
such quenched materials will not then lead to an abnormally high 
diffusion coefficient if one attempts to measure this by conventional 
means. Thus quite recently Horrman and TURNBULL*® found no 
difference between the self-diffusion coefficients of silver samples 
whether these were air quenched or slowly cooled before the experiments. 

However, the recently developed anelastic methods (see Section IV) 
which allow measurements of diffusion coefficients at much lower 
temperatures and in shorter times than by the conventional methods, 
have been used by Now1ck”™ successfully to demonstrate that D is, in 
fact, larger in a quenched alloy (of 29 atoms per cent Zn in Ag) than in 
the same alloy more slowly cooled. The method is such, of course, that 
any number of diffusion experiments may be made on the same sample 
and so one may follow the rate at which the increased diffusion coefficient 
falls to the normal value appropriate to slowly cooled or annealed 
alloys, as the vacancy concentration returns to its equilibrium value. 
In this way NowIck estimates that AH,, the activation energy for the 
formation of vacancies, is ~ 12-5 kcal/mole and AH,, that for their 
exchange with atoms, ~ 20-4 kcal/mole, corresponding to the total 
activation energy Q for diffusion of 32-9 kcal/mole. It is interesting in 
connexion with remarks in the next part of this section that AH ,/Q@ =~ 0-4. 

Thus is confirmed a necessary consequence of the supposition that 
diffusion occurs by vacancies. 

There are many properties of metals which are intimately related in 
some way to their diffusion behaviour so that a study of such properties 
may be of use in furthering a knowledge of the diffusion processes 
themselves. By assuming that vacancies exist in the lattice and that 
diffusion occurs by the migration of vacancies, many of these properties 
have been successfully interpreted using the known values of diffusion 
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coefficients, or conversely have provided diffusion data agreeing with 
that obtained by direct measurement. Such work then strengthens the 
evidence in favour of the vacancy mechanism in the f.c.c. structures for 
which these other properties have been successfully so interpreted. 
The work on sintering and on the creep of wires will be discussed in 
Section IV. These involve changes of shape at rates determined by 
diffusion processes and so like the Kirkendall effect rule out the 
possibility of an exchange or ring mechanism. It has been suggested’: 4 
that the increase in electrical resistance of metals after cold work is 
due to the generation of an increased vacancy concentration by the 
movement of the dislocations. MANINTVELD” has recently studied the 
rate of return of the resistance to its normal value on annealing, a rate 
determined by the migration rate of the vacancies, and has calculated 
the activation energies for the movement of single vacancies in Au, 
Ag and Cu to be 16, 15 and 20 kcal/mole, respectively. Although 
appreciably less than half the corresponding total activation energies for 
diffusion, these are not unlikely values. The same experiments provide 
possible evidence for migration of pairs of vacancies, with activation 
energies rather less than those for single vacancies. 

There is also an increase in resistance of pure metals after neutron 
irradiation. Since such treatment can only produce vacancies or inter- 
stitials, it is most interesting that the activation energy for the recovery 


of the resistance on annealing turns out, at least for platinum,™! to be 
equal to that for recovery of the resistance produced by cold work in 


the same metal. 

It should be mentioned that such experiments as these constitute 
“evidence” for a vacancy mechanism only because the calculations of 
Huntrneton and Serrz show an interstitial mechanism to be most 
unlikely on energetic grounds. In the absence of such calculations, 
Nowick’s experiments, for example, could as readily be explained on 
the supposition that diffusion occurs by any mechanism involving lattice 
defects whose concentration increases with temperature. 

Attempts have also been made”®: ** to obtain evidence for a vacancy 
mechanism by studying diffusion in alloys such as FeAl, NiAl and 
CoAl which are known to possess a defect structure in that in certain 
ranges of composition a large number of lattice sites are vacant. Thus, 
for example,” on the cobalt rich side of approximately 50-50 atomic 
per cent b.c.c. alloys of CoAl, the cobalt atoms replace aluminium, atom 
for atom, but on the aluminium rich side, althougg the b.c.c. structure is 
maintained, the number of atoms per unit cell is less than two and many 
sites are in consequence vacant. If such vacancies facilitate diffusion 
then a marked increase in the diffusion coefficient should occur as the 
concentration passes through 50 per cent towards higher aluminium 
content. Such an increase has in fact been observed by Nix and 
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JaumorT’ for the diffusion of cobalt in CoAl alloys. Buregss and 
SMOLUCHOWSKI,” however, found no similar increase in Ni Al alloys and 
attributed this to the inability of the vacant sites in such alloys to play 
the role required of them in diffusion on account of their small size 
compared to that of the aluminium atom. 

One would expect a similar effect with no increased diffusion in the 
Co/Al alloys and so the evidence from these experiments is not very 
helpful, but it is in any case difficult to see why such experiments could 
be taken as providing evidence for a vacancy mechanism except in 
these rather special defect alloys. Any increase in diffusion in the 
concentration range where the alloys are defective may certainly be due 
to a vacancy mechanism, but this throws no light on the mechanism 
in the non-defective structures, which, being body centred cubic, is 
much needed. 

(d) Theory of D —Diffusion coefficients for self-diffusion and for 
chemical diffusion in substitutional solid solution are well known to 
vary with temperature through the Arrhenius type equation (12). Just 
as in Section 2 (a) a comparison with experiment of the values of Q to 
be expected from the various proposed models for diffusion has enabled 
a choice to be made between these models, so in general may we expect 
it to be possible to make or confirm the choice by comparison with 
experiment of observed values of D, for the various models. 

The most forbidding feature of the reported values”* of D, for chemical 
diffusion is that they cover the alarming range of 10-* to 10 cm?/sec. 
For self-diffusion D, lies, however, in the far more moderate range of 
10-1 to 10 cm?/sec and so for this reason and because it is the simplest 
possible type of diffusion in metals, it is natural that attempts should 
first be made to understand the values of D, associated with it. 

A careful reassessment”: ®° of the reported work on self-diffusion 
measurements has provided what are on the evidence available, the 
most likely values of D, and Q for the f.c.c. metals Au, Ag, Pb and Cu. 
An interpretation”: * of this data by a method which is an extension of 
that used so successfully by Wert and ZENeER**** for interstitial 
diffusion; has shown that the values of D, for self-diffusion in these 
metals are consistent only with, and, therefore, confirm the operation 
of, the vacancy model of diffusion. The same methods applied to the 
b.c.c. metals suggest that self-diffusion occurs in them by a 4-ring 
diffusion process. 

(i) Self Diffusion—Equation (13) for D holds equally well for 
vacancy diffusion provided it can be assumed that successive jumps of 
any individual atom are random in direction and independent of one 
another. This assumption, valid certainly for dilute interstitial solid 
solution, is not precisely valid for vacancy diffusion for an atom which 
has just exchanged with a vacancy will be more likely to make its next 
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move back to its original position rather than in any other direction. 
The error involved in making it (~ 10 per cent)*® is well within the 
error of most experiments and so at present unimportant. « in equation 
(13) is readily shown to be 1/12 for an f.c.c.® and 1/8 for a b.c.c. lattice. 

If 8 is the fractional concentration of vacancies, then in an f.c.c. 
lattice 129 is the probability that an atom has a vacant site as neighbour. 
Only then, of course, may the atom move 

If n’ is the rate of jumping of an atom into an adjacent vacant site, 
then clearly 


12fn 5 (20) 


where +r is as previously defined (p. 295). Similarly for a b.c.c. lattice 


l T Son ° ° ° . (21) 


If AG, is the change in free energy on forming a vacancy at constant 
temperature and pressure, apart from that contributed by changes in 
mixing entropy, then the equilibrium concentration of vacancies is 
readily shown to be 


exp (— AG,/RT) i a ae 


n’ may be evaluated by methods identical with those used by Werr*’ 


for interstitial diffusion, when we find 
n’ y exp AG, RT) _-— as @ (23) 


where » is the frequency of vibration of an atom on a lattice site and 
AG, is an amount of isothermal work defined exactly as on p. 296. 
Using the same thermodynamic methods and relations as for the 
interstitial case we find from equations (13) and (20) to (23) that for 
self diffusion in both b.c.c. and f.c.c. lattices, 


D = a*y exp [(AS, + AS,)/R) exp [— (AH, + AH,)/RT] . . (24) 


where AS, and AH, are the entropy and enthalpy associated with the 
free energy AG,, and similarly for AS, and AH, . (AH, + AdH,) cor- 
responds to Y, the observed activation energy, and the constant term 
D, is given by 


D, a*y exp [(AS, + AS,)/R) »necse Ge 


As before, if the work AG, appears as strain energy in the lattice AS, 
will be positive. And also*™ AS, will be positive for when a vacancy is 
formed in the lattice the neighbouring atoms acquire a greater freedom 
of movement so that their entropy must increase. Thus the total entropy 
of activation (AS, + AS, AS must be positive. ZenerR™ points 
out that this is, in fact, true for self-diffusion in all cubic metals. 

As we have seen, the calculations of HuntrneTon and Serrz provide 
values of AH, and AH, agreeing quite closely with experiment, but no 
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such corresponding calculations, necessary to complete the theoretical 
interpretation of D, have yet been made for AS, = — dAG,/aT and 
AS, = — dAG,/oT. Very rough estimates have from time to time been 
made (see PMP.1) and do produce values of D, in order of magnitude 
agreement with experiment. An exact calculation of AS, and AS, 
would be very difficult, but we may, as in the interstitial case, at least 
obtain an expression for AS in terms of AH, although the analysis is 
more complicated here by there being two terms in the activation 
energy. However, we proceed in a similar way and propose a correlation 
between both AG, and AG, and other physical properties of the lattice 
to which we might expect these quantities to be most closely related. 
Thus we write again equation (17) for AG,. For geometrically similar 
lattices e will be the same and we may suppose v to be proportional to 
the atomic volume M/p, where M is the atomic weight and p the 
density, so that we may write 


AG, = kM u p rn (26) 


where &, is a constant for metals of one structure. AH, is the change in 
internal energy on forming a vacancy and it has often been proposed 
that this will be closely related to the latent heat o: sublimation L,, so 
we put® 


AH, = kL, eee 


Assuming « and p to vary linearly with temperature we then find, 
using the equation AS = — dAG/dT’,, that 


GY = kL, + kM uig/ po a. 


M ity , 
AS AS, - ky :(44/ Ma) (p/pe) ji - - - - (29) 
Po 
where the subscript , refers to the constant term in the linear expressions 
for uw and p as in equation (18) for u, and the dash denotes differenti- 
ation with respect to temperature. The second term in (29) is negative, 
so that AS remains positive. 
We may eliminate &, from (29) using (28) and so find for AS 


AS [AS, — (9 — &L,){(u/ uo)’ — (p/po)'} « - - « (30) 


The experimental value of AS is given by R log (D,/a*v) so k, may be 
calculated for each of the cubic metals for which D, and @ are known 
by inserting in equation (30) appropriate values for the other quantities. 
AS,, the change in entropy on forming a vacancy, arises principally from 
the lowering of the vibrational frequency of the atoms surrounding 
the vacant site, but a proper calculation of this frequency change does 


not appear to have been made. It is to be noted, however, that the local 


values near a vacant site, certainly of «4 and most probably of (4/4) 
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will be less than the bulk values, and these changes in elastic properties 
and in » will be closely related. It is the local values of (u/u,)’ which 
should of course be used in (30). But since £, derived from equation (30) 
decreases with decreasing AS, and increases with increasing (u/4,)' we 
shall assume for simplicity that &, is unaffected if we put AS, = 0 in 
equation (30) and employ in the calculations the bulk value of (/ 5)’. 
For » we may take the Debye frequency. Using for (u/u,)’ the same 
source of data as employed in Part | of this section, it is found” 
that 4, is very appreciably the same for the four f.c.c. metals for which 
all the relevant data exists, as shown in Table 2. 

TABLE 2 









0-178 







Q kcal/mole 27-5 45-9 53-0 46-5 59-7 142 
ky 0-21 0-21 0-20 0-23 Mean k, 0-21 0-075 0-147 
k, 0-23 0-22 0-25 0-18 Mean k, = 0-22 - 













The constancy of &, for the f.c.c. metals justifies the correlations 
(26) and (27) and the assumption of a vacancy mechanism on which 
they were based. The mean value of k, gives for Cu, for example, 
17-4 kcal/mole as the energy required to form a vacancy. This, being 
about 0-4 of the total activation energy is a very reasonable value, 
although a little lower than the value of 0-64Q estimated by HunTrInec- 
Ton and Serrz.*. * But since both this and the present estimate are 
only approximate, the discrepancy does not invalidate the present 










analysis. 

However, we see from Table 2 that the values of &, for the b.c.c. 
metals are not only well removed from those for f.c.c. but are in addition 
negative; a negative &, in equation (27) is meaningless. This may 







indicate inaccuracies in the measurements for W (see also p. 295) and 
x-Fe or that in their case a vacancy mechanism of diffusion does not 
operate. It is very interesting that the observed values of D, and Q for 
these two metals can be satisfactorily related only on the assumption 
that diffusion occurs in them by a 4-ring exchange mechanism. 

The diffusion coefficient for self-diffusion by a ring mechanism may 
be shown" to be given by 


D = yva* exp (— AF/RT) 












where y is an integer always greater than 1 and depending on the 

number of the ring and upon the type of lattice. A/F is the free energy 

of activation for the ring diffusion process and, as before, we may put 
AF = kM u/p 
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and so find 
AShing = O{(4/ Uo)’ — (p/po)’} = FR log (Do ya*y) 


Thus y may be calculated from the equation 
R log y = R log (D,/a*v) — Q{(u/ug)’ — (p/po)’} . . ~. . (321) 


and the values turn out, for f.c.c. lattices, to be not only far from 
constant but all small fractions inconsistent with the supposition of a 
ring mechanism. But for W and a-Fe they are greater than | and both 
closer to the value of y (= 6) for a 4-ring diffusion process than for any 
other numbered ring. 

An analysis similar to the above, with the same type of assumptions, 
has been carried out” for interstitial diffusion but fails to provide values 
of AS and, therefore, D,, which are consistent with experiment either 
for b.c.c. or f.c.c. metals. 

The methods outlined here, which avoid any direct calculation of D, 
show that observed values of it may be satisfactorily interpreted on the 
basis only of a vacancy mechanism for f.c.c. metals and of a 4-ring 
mechanism for b.c.c. metals. Both results are quite consistent with 
previous evidence on the mechanism of diffusion in these structures. 

Equation (30) may be written in a simpler form by noting, as was 
pointed out by SteigmMan, SHOCKLEY and NIx™* sometime ago (see 
also ref. 31), that the ratio Q/L, is fairly constant from metal to metal 
and of mean value about 0-6. Putting L, = Q/0-6 in equation (30) 
we find 

AS = 0-65Q{(u/ 9)’ — (p/po)’} jane 


If further we ignore (p/p,)’ in the above analysis, as being smaller 
than (u/u,)' and recalculate a mean value of k, (the separate values of 
k, turn out to be not quite so constant as in Table 2), we find that 


AS = 0-550(u lg)’ re 


which is exactly the result obtained by ZENER who plotted log (D,/a*v) 
against @(u/u,)'/R and found the points for f.c.c. metals to be fairly 
close to a line of slope 0-55. ZENER, however, provides no clear under- 
standing of the significance of the numerical factor. As would be 
expected from the foregoing, the points for b.c.c. metals lie well above 
the line of slope 0-55 and ZENER interpreted this as probable evidence 
for a ring diffusion process. 

With the now known value of k,, k, can be calculated from equation 
29) or preferably equation (28). An examination of the displacements 
of its neighbours during the diffusion of an atom shows that these 
correspond crudely, when resolved in two directions at right angles, to 


9) 


atomic movements associated with shears for which C,, and (C,, — C,,)/2 
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are the appropriate moduli. Using the sum of these as a rough estimate 
of the appropriate modulus in equation (28), &, is found” to be fairly 
constant, as shown in Table 2. Ideally, of course, the temperature 
coefficient of (Cy, + (C,, — Cy,)/2) should be used in equation (30) in 
determining £,, but the necessary measurements do not appear to have 
been made. We might expect the value of (u/u,)’ to be less sensitive 
to the type of modulus chosen than is u« itself, so that the calculation of 
k, may not be seriously affected by the modulus we employed not being 
the most appropriate. &,, which requires a knowledge of u and (u/)’, 


may be more sensitive to choice of correct modulus and a more careful 


study of the problem may reveal a modulus which gives more constant 
values of k, than does the one we have used. 

Equations (28) and (29) provide a useful working rule for estimating 
those self diffusion coefficients of metals which have not yet been 
directly measured. 

(ii) Chemical Diffusion—It might be assumed that equation (24) 
applies equally well, just as it stands, to diffusion of the solute in a 
dilute solid solution, where the mutual interaction of solute atoms may 
be neglected. However, equation (24) is based on the assumption that 
successive moves of a diffusing atom are independent of one another 
and for chemical diffusion such independence does not necessarily hold. 
It does not hold for example when diffusion occurs by migration of 
solute atom—vacancy molecules, as JOHNSON®** suggested would take 
place if vacancies were more readily formed near to solute atoms than 
elsewhere. However, ZENER has analysed this case and concludes that 
equation (24), unchanged, is still applicable. But a more extensive 
investigation of other possible situations would be very welcome. 
For example, if, when vacancies are formed equally readily on all 
sites so as to be distributed at random in the lattice a vacancy exchanges 
much more readily with a solute atom than with a solvent atom, 
then when a vacancy meets a solute atom it will exchange it with 
it a large number of times before moving away again: successive moves 
of a solute atom are not then independent of one another and an analysis 
of this situation®* suggests that the equation should be multiplied by 
some factor of the order of magnitude of the co-ordination number. 

However, if we calculate values of AS, using the equation AS 
= R log (D,/a*v) from the extensive available data on diffusion of 
solute atoms in dilute solid solution, we find* the great majority of them 
are negative and often numerically very large. It seems unlikely that 
refinements to equation (24) would appreciably change this finding and, 
in fact, the one we have mentioned would produce even more negative 
values of AS. Now we have seen that negative values of AS are 
inadmissible so it appears that very many reported values of D, for 
chemical diffusion in dilute solid solutions are smaller than can be 


310 





DIFFUSION IN METALS 


accounted for by an extension of the theory which so successfully 
accounts for values of D, in self diffusion. 

It is important to note that, very often, the lower the experimental 
values of D, for the diffusion of a series of solutes in the same solvent 
the lower also is the value of Q, relative to the self-diffusion activation 
energy of the pure solvent. If the unaccountably low values of D, are 
low because of experimental errors in the measurements, this correlation 
with @ values suggests that such errors may arise from the use of poly- 
crystalline material in the measurements. Grain boundary diffusion 
processes, by contributing appreciably to and so increasing above the 
true lattice diffusion values the low temperature measurements of D will 
lead to abnormally low values of Q@ and at the same time of D, as 
discussed more fully on page 286, and these errors will be related to 
one another because both arise from the same cause. We may obtain a 
rough estimate of this relation as follows. Values of D are most likely 
to represent true lattice diffusion at high temperatures and certainly 
near the melting point 7',,, We may, therefore, assume that the 
straight line representing log D for true lattice diffusion plotted against 
1/7, may be obtained from the “best straight line’ drawn through 
experimentally obtained high and low temperature values of D by a 
clockwise rotation about the point where it passes through 7',,. It is 
then readily shown that log D, and @ derived from the experimental 
straight line will differ from the true lattice diffusion values by amounts 
A log D, and AQ, which are related by 


A log (D,/a*v) = AQ/RT'y, toece 


Relation (34) is most interesting in connexion with a correlation 
observed by Drenes*’: *? between D, and Y. DreneEs showed that if 
experimental values of log (D,/a*v) for both self diffusion and chemical 
diffusion in dilute solutions are plotted against Q/RT7',, the points all 
lie, although roughly, along a line of slope one and so of equation 


log (D,/a*v) = Q/RT,, — constant oces Oe 


It can be shown” that because of a loose correlation between (1/1,)’ 
and 1/7',, this equation may be derived with the same value of the 
constant as a very rough approximation to our previous expression for 
log (D,/a*v): but our theory does not allow the negative values of 
R log (D,/a*v) = AS, which this equation claims to cover. It is clear 
then why the experimental values of D, and Q for self-diffusion lie 
near the line of equation (35). The very extensive range of values of 
D,/a*v, covering about nine cycles of ten, which are included in the 
correlation, tends to mask the fact that many points lie away from the 
line by amounts which correspond to errors of a factor as great as 10 in 
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the values of D,/a*v. If then we take equation (24) as valid for chemical 
diffusion in dilute solid solution and assume that, say, equation (33), 
even approximately represents the value of AS for chemical diffusion, 
then since AS = R log D,/a*v is proportional to Q and since further 
(4/9) for the alloy may not differ much from that for the pure solvent, 
we might expect the true values of D, to give points fairly close to 
DreNnes’ line of slope 1. But even if the experimental values of D, and Q 
are in error, for reasons such as mentioned above, so that these errors 
are related by equation (34), then by virtue of this equation they will 
continue to lie near the line of slope 1 when plotted as log (D,/a*v) 
against Q/RT7',,, but in a region removed from that corresponding to the 
true values of D, and @ by an amount which is larger the larger the 
errors in D, and Q. 

The theory of chemical diffusion even in dilute solid solution is by 
no means complete and the above discussion of it in consequence very 
approximate, but it does appear from this work that many of the 
reported values of D, and @ are not truly representative of lattice 
diffusion. That D, and Q are roughly correlated in the way shown by 
DreNnes does not necessarily mean that all values of D, and Q are at 
least approximately correct and that the correlation is a true conse- 
quence of lattice diffusion, for it is as readily explained if the departures 
of D, and Q from the values for true lattice diffusion are related in a 
way approximately represented by equation (34). One such way as we 
have seen is through the operation of grain boundary diffusion processes, 
but any effects which produce enhanced overall diffusion at low tem- 
peratures, by providing what ZeneR™ calls “short circuiting paths,” 
with a lesser tendency to do so at high temperatures, will have a similar 
influence on reported values of D, and Q. Many of the complicating 
features associated with the Kirkendall effect are just such as might 
be more important at low temperatures. Thus for example the lattice 
distortion accompanying the shrinkage and expansion which occurs in 
chemical diffusion and which will favour increased diffusion rates, will 
persist more readily at low than at high temperatures. Both Kirkendall 
effects, through the influence of these features, and grain boundary effects 
in polycrystalline material occurring even in the absence of a Kirkendall 
effect, may lead to values of Q and D, which do not represent true 
lattice diffusion. The latter may be avoided by using single crystal 
material or, if polycrystalline, by working over a temperature range 
such that for the grain sizes used, grain boundary diffusion is negligible, 
but the Kirkendall effects are to be avoided only by working with very 
low concentration gradients, or with zero gradients as in chemical self- 
diffusion studies. 

Evidence that these considerations may be quite important is 
provided by recent measurements at extremely low concentrations of 
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Au (0-001 per cent) in Cu®* and of Co in Ni,*®* for these both yield values 
of D, which correspond to positive values of AS, even agreeing roughly 
with the theoretical values given by equation (33). This is very far from 
true for the many dilute solution values of D, for other elements 
diffusing in Cu and which have been measured with larger concentration 
gradients. The few results of chemical self-diffusion coefficient measure- 
ments and of measurements of diffusion coefficients by the anelastic 
methods discussed in Section IV, both of which employ chemically 
homogeneous materials, also lead to positive values of AS. It is also of 
great interest that @ for Au in Cu is quite close to that for self-diffusion 
in Cu in contradistinction to a general result that @ for low concentra- 
tion solute diffusion in a solvent is always appreciably less than that for 
self-diffusion in the pure solvent (see PMP.1). This one result throws 
doubt on this commonly accepted generalization which was the basis of 
JoHNson’s*®* solute atom-vacancy “molecule” diffusion mechanism. 
Further investigation may possibly show it to have been a consequence 
of Kirkendall or concentration gradient effects, absent in self-diffusion, 
introducing errors into chemical diffusion measurements even at quite 
small concentrations of a few per cent. 

Many of these matters have been discussed at length by Nowick,® 
who also makes use of the foregoing theory of D, to calculate lattice 
values for D, and Y from those reported measurements of D in various 


low concentration binary systems which were made at the highest 
temperature used in the investigations, and so are most likely to 
represent true lattice diffusion behaviour. Using ZENER’s approximate 
equation (33) we have for D 


D = va* exp (0-55Q{u/uo}'/R) exp (— Q/RT).. . . . (36) 


Hence a single high temperature value of D serves to provide a value 
for Q which, as Now1ck points out, even if equation (36) is correct only 
to an order of magnitude, will be more accurate than obtained from a 
conventional plot of log D ~ 1/7’, for such plots often yield values of D, 
which appear to be in error by factors as great as 10°. In this way, 
using values of (/,)’ for the pure solvent, he finds that the recalculated 
activation energies for the diffusion of a series of solutes in the same 
solvent are, in fact, all much closer (within ~ 15 per cent) to that for 
self-diffusion in the pure solvent than the reported values. Also the 
solute activation energies appear often to be greater than the solvent 
self-diffusion activation energies for those solutes which according to 
Koéster and RavscHEeR™ increase the elastic modulus of the pure 
solvent and to be less for those solutes which decrease the modulus, as 
might be expected from equation (28). From these values of @ Nowick 
also calculated the corresponding values of D, from equation (33) but 
these are much more sensitive to any inaccuracies in equation (33) than 
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are the @ to inaccuracies in equation (36), as the putting of appropriate 
quantities into this equation when solved for Q will show. 

Little has been said about the applicability to chemical diffusion of 
the theoretical expressions for AS we have discussed for self diffusion. 
While equation (30) may be formally applicable it is, particularly in the 
absence of a clearer knowledge of the mechanism of chemical! diffusion, 
more difficult to know what values to insert for the various quantities 
it contains. Such matters, and to what extent equation (33) is still valid 
48 an approximation to (30) remain for further study, but it is to be 
noted that when experimental measurements lead to positive values of 
AS, these agree, at least within an order of magnitude, with the values 


given by equation (33 


SECTION IV 







EXPERIMENTAL METHODS 









The various conventional experimental methods by which chemical 
und self-diffusion coefficients in the various metals and alloys have been 






measured have been adequately reviewed elsewhere,': °. ©. * together 






with the mathematical equipment required for their interpretation. 






These generally require the measurement of the rate at which an 






inhomogeneity in chemical composition, or in content of radioactive 






stoms, is changed by the process of diffusion and a compilation of the 






data accumulated by these methods has recently been published in 
SMITHELLS Metals Reference Book.”* It is to be emphasised, however, 







that much of this is probably of uncertain theoretical significance for 






many reasons, discussed either in PMP.1 or in the present chapter. 






That diffusion does take place can only mean that single atoms can 





migrate from one site in the lattice to another and in theoretical inter- 





pretations of diffusion coefficients the rate of this migration is regarded 





us the fundamental quantity. But any property of a metal which can 






be considered to depend upon the migration of atoms may in principle 






be employed to measure the rate of migration and so to provide informa- 





tion on diffusion 
lhus, for example, ALEXANDER, Dawson and Kurve™ have studied 
the variation with temperature of the creep rate of gold wires under load 






und assuming this to occur by the vacancy mechanism of creep suggested 





by NaBARRO” have calculated from the results a value of 48 kcal/mole 





for the activation energy for gold self-diffusion. Using a more refined 





theory due to Herrivnc™ they obtain a value of 50 kcal/mole, agreeing 





better with the directly measured value of 53.” 






But two properties in particular have been made use of recently to 






determine diffusion coefficients and these provide several advantages 
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over the conventional methods of measurement. These are. certain 
types of internal friction in metals and their ability to sinter. 


1. Sintering Method for Measuring Self-diffusion Coefficients 
Kuczynski®-** and others have studied the rate of sintering of 
spherical particles to a flat plate of the same material or of a fine wire 
to the surface of a cylinder on which it is wound, when the two become 
bonded together in a way shown schematically in section in Fig. 11. 
Various processes may be considered as responsible for the obvious 
transfer of material that takes place. If this is to be by volume diffusion 
then Kuczynsk!™ supposes that, just as 

the equilibrium vapour pressure over a 

convex surface of a liquid is greater than 

that Over a plane surface 50, assuming a 

vacancy mechanism of diffusion, is the 

equilibrium vacancy concentration over the 

highly convex surface just inside the neck 

or bond in Fig. 11 greater than that pre- 





vailing elsewhere. This difference in con- 
Fig. 11. Section of Wire or 


centration is such as to cause vacancies to 
Spherical Particle sintered 


flow away from the neck and, therefore, to flat surface (Schematic)®* 
atoms into it, enabling the bond to grow in 
width. With such a model for the growth of the bond between particle 
or wire and plate KuczyNskI obtains expressions from which the self- 
diffusion coefficient can be calculated. For a particle bonding to a plate 
D = kT x*/40y6*ta* sooo, Ge 
and for a wire bonding to a plate or cylinder of large radius 
D = 3kT x*/80ry6*ta* , , (38) 
where ¢ is Boltzmann’s constant, 7 the absolute temperature of anneal, 
y the surface energy of the metal, 6 the interatomic distance, and a the 
radius of the particle or wire. The experiments require a measurement 
of the width 22 of the bond formed after a time of anneal ¢. This is 


done by microscopic measurements on samples suitably mounted and 


sectioned. The necessary dependence of x° on { demanded by equations 


37) and (38) has been confirmed, at least for not too small sized particles 


97 ve 


l5u), by experiments on silver and on copper® * and the 
measurements used to calculate the lattice self-diffusion coefficients for 
these two metals 

Since z° is proportional to / it is important to obtain an accurate 
value for 22 But the present methods of experiment seem to give a 
considerable spread of values for 2z from particle to particle or in 


different parts of the wire when this technique is used, and a mean value 
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must be taken for a large number of observations. This must detract 
from the accuracy of the measurements, and for this reason, because of 
the approximate nature of the derivation of equations (37) and (38) 
and because of uncertainties in the value of y, it is surprising that the 
results for silver should agree so closely with those measured by conven- 
tional radioactive methods. Thus Kvuczywsk! finds for silver,”’ using the 
wire and cylinder method, which is reported as being the most convenient 
und accurate, D = 0-9 exp 45,700/RT) compared with JoHnson’s 
results,** recently well confirmed by Horrman and Turnsvc.y,” of 
D 0-89 exp 45.900 RT’ 

The results for copper are in rather less good agreement but still 
fairly close to those obtained by conventional radioactive methods. 

KuczyNskI® considered that the transfer of matter was definitely by 
volume diffusion for his examination of the other possible processes of 
viscous or plastic flow, of evaporation and condensation and of surface 
diffusion suggested that these require the relation between z and ¢ to 
be, respectively, z* ~ ¢, z*> ~¢, and 2’ ~t, whereas the experiments 
clearly produced a fifth power law consistent only with the supposition 
of volume diffusion. However, both Caprera®” and Scuwep’” have 
criticized KvuczyNskI's treatment of surface diffusion and pointed out 
than an z*° ~ ¢ law should hold for both surface and volume diffusion : 
Scuwep further suggests that for very small particles the law should 
become z* ~ t. The experiments on silver and copper seem though to 


support the view that sintering is in fact by volume diffusion and not 
surface diffusion, at least for the particle and wire sizes used, particularly 


because the activation energies are very close for silver and for copper to 
those found for volume diffusion by more direct methods. 

The method is one of promise and seems capable of providing quite 
accurate values for D, although it has not yet been fully developed 
experimentally: another way of measuring the width of the bond which 
has been suggested!" is to follow the change in contact resistance between 
two metallic spheres or hemispheres as these sinter together. It would 
benefit from a more thorough theoretical treatment of the sintering 
process. It has the great advantage over conventional methods in that 
it could be used to measure the self-diffusion coefficients of metals for 
which no suitable radioactive tracers exist, such as aluminium and 


magnesium. 


2. Anelastic Methods 
In both interstitial solid solutions in b.c.c. metals’ and substitutional 
solid solutions in f.c.c. metals’ there is a type of anelasticity which 
has been interpreted as arising from a redistribution of solute atoms in 
the presence of an applied stress, and which is, therefore, intimately 
related to the diffusion properties of the solute atoms. Measurements on 
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this type of anelasticity have recently been employed to measure the 
diffusion coefficients of solute atoms in both these types of solid solution. 

A solute atom in an octahedral interstitial position will produce a 
distortion in the surrounding lattice with tetragonal symmetry, the 
tetragonal axis being along one of the three (100) directions. In 
the absence of any applied stress the tetragonal axes of distortion will 
be distributed equally among these three (100) directions, but if a 
tensile stress is applied along one of these directions atoms will tend to 
move into those positions for which the tetragonal axis is along the same 
direction, for the stress is then to some extent relieved. A certain time 
will be required for the establishment of the new stress induced preferen- 
tia] distribution of atoms so that the strain will lag behind the stress. 
This hysteresis will give rise to relaxation and other anelastic effects 
such as internal friction. 

In substitutional solid solution isolated solute atoms produce a local 
distortion with cubic symmetry so that no redistribution occurs on 
applying a stress. But in a random distribution of solute atoms there 
will occur pairs of solute atoms which are nearest neighbours to one 
another and the local distortion produced around such pairs will be of 
tetragonal symmetry. On applying a stress the pair axes will tend to 
orient themselves along those crystallographically permissible directions 
which make the least angle with the direction of stress, for by so doing 
the stress is to some extent relieved and the strain energy reduced. 
A pair reorientation is achieved by the migration of either member of 
the pair to a mutual nearest neighbour position, and it is to such a 
process that these particular internal friction effects, observed only in 
alloys, have been attributed. 

Clearly, one would expect the relaxation strength, or the magnitude 
of the internal friction, to be greater, for equal solute concentration, 
for interstitial solid solution in b.c.c. lattices than in f.c.c. substitutional 
solutions, for in the latter case it is only the randomly occurring pairs 
of atoms which contribute to the anelastic effects. Thus, these may be 
observed with very low concentration of for example C in a-Fe 
(-00] per cent) whereas a concentration greater than ~ 10 per cent is 
necessary with substitutional alloys. The larger the difference in size 
between solute and solvent atoms the greater the relaxation strength 
and so the lower the concentration at which observable effects are 
obtained. But, at these high concentrations the concept of the reori- 
entation of pairs requires considerable modification, for much larger 
groupings of solute atoms will occur and isolated pairs will be relatively 
uncommon. However, calculations based on the simple pair concept do 
seem to provide from experimental measurements, values of D which, 
when extrapolated to the higher temperatures at which conventional 
diffusion coefficient measurements are made, agree fairly well with the 
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values the latter give. This, and other evidence, confirms that the 
internal friction effects are intimately associated with the diffusion 
properties of the alloy, but a more careful theoretical treatment would 
be welcome. The anelastic effects in the b.c.c. interstitial solutions are 
more clear in origin and free from such difficulties of theoretical 
interpretation. 

These anelastic effects’ are characterized by the “relaxation time” 
tT», required for the establishment of the new stress induced equilibrium 
redistribution of atoms or pairs of atoms and r, is clearly closely related 
to r, the mean time of stay of any one atom in one lattice position. The 
diffusion coefficient D for the solute atoms is given by equation (13), so 
that a determination of +, serves to determine D. 

For interstitial solid solution’*: * in b.c.c. lattices r = 3rp,/2 and 
for substitutional solid solution in f.c.c. lattices it is sufficiently accurate 
to take r = +r, for polycrystalline material, although for single crystals 
the relation between +r and rp, varies with the direction of the stress. 

Tp may be determined by measurements either of the internal friction 
or of the elastic after-effect .* 

The first method consists in measuring the temperature at which the 
maximum occurs in a plot of internal friction as a function of tempera- 
ture. If & is the vibrational frequency at which the experiments are 
being conducted then the r+, at this temperature = 1/0. Measurements 
of rp, at other temperatures are made by varying the frequency @. 
With the specimen as a wire forming the suspension of a torsional 
pendulum and with photographic recording of the vibrations, measure- 
ments may be made in the range ~ | to 100°c.p.s. For higher frequencies 
and, therefore, also temperatures, it is more convenient to study the 
damping of a bar of material set into flexural oscillations, using 
electronic methods of timing. 

In the second method, applicable at lower temperatures and for 
values of r, greater than a minute or so, the specimen in wire form is 
twisted and kept so for a time of the order of r,. On releasing the 
stress the elastic strain is recovered immediately, but the anelastic 


strain only gradually and a measure of this strain as a function of time 


determines T R- 

Using both methods values of rt, from about | hr to, say, 10~* sec 
(for @ ~ 10K c.p.s.) may be measured covering 8 cycles of 10 in the 
value of D. The corresponding temperatures are such as to extend the 
temperature range over which diffusion coefficients may be measured 
from the comparatively restricted range covered by the conventional 
methods right down to below room temperature for C in «a-Fe, for 
example,*’ and to within about 200°C of room temperature for alloys 
such as CuZn'™ and AgZn.'” 

This very considerable advantage of the method means that very 
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accurate values of D, and Q may be obtained, which, as mentioned on 
p. 313 give positive values for the entropy of activation. Kirkendall 
or concentration gradient effects are without influence on the results for 
measurements are made, of course, on chemically homogeneous material, 
and, therefore, also provide values of D at well-defined compositions. 
Grain boundary diffusion plays no part in determining 7, but, at least 
for the substitutional alloys, single crystal or quite coarsely poly- 
crystalline material must be used to avoid or reduce anelastic effects 


Temperature C 
100 20 400 600 800 
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1000/T 


Fig. 12. Diffusion Coefficient of— 

Conventional Diffusion Measurements—STANLEY'” 

Elastic After-effect Measurements—WERT*’ 

Elastic After-effect Measurements—RICHTER!** 

Internal Friction Measurements—W ERT*’ 

‘Precipitation Measurements’’—WErRT*’ (Not des- 
cribed in text—See ref. 


associated with grain boundaries such as grain boundary slip,!* which 
might otherwise contribute more appreciably to the observed anelastic 
effects than the process being studied. 

For substitutional solid solutions the method has certain limitations. 
The solute and solvent atoms must differ appreciably in size and the 
concentration must not be so low that the number of solute atom pairs is 
inappreciable. 

These methods have been employed most extensively to interstitial 
solid solution, in particular of C1®, 10, 56, 57 and N12, 56, 58 in g-Fe and 
of O,,4° N,,™ and C™° in Ta and have provided**—** the accurate values 
of D, and Q made use of in the work discussed in Section III. Fig. 12 
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shows the results for C in «-Fe largely by WerT,*’ together with those of 
StanLEYy’” measured by the conventional methods. The extension of 
the measurements to lower temperatures clearly provides for an 
accuracy in D, and Q quite unattainable by the conventional methods 


alone. 

Comparatively little use has yet been made of the method with 
substitutional solid solutions. Now1ck'®”’ has studied the diffusion of 
Zn (15 per cent) in Ag and Le Ciarre and CuarLps' of Zn (10-30 per 
cent) in Cu and in each case the results give a very accurately linear 
relation between log D and 1/7’, which when extrapolated to high 
temperatures give values of D agreeing fairly well with those obtained 


by conventional methods. Earlier measurements’: '? of the internal 
friction in 30 per cent Zn a-brass have since been made use’ of to give 
values for D. An interesting result of the CuZn experiments is that 
between 15 and 30 atomic per cent Zn, the activation energy is fairly 
constant at 38,000 kcal/mole, contrary to previous measurements 
which show a marked variation. D increases with composition but not 
so rapidly as indicated by Rutves and Ment." 

An interesting use of this method stemming from the advantages of 
the short times in which and the low temperatures at which diffusion 
coefficients may be measured by it, has already been described (p. 303). 

These advantages arise, of course, because the internal friction 
methods of measurement only require the diffusion of atoms over 
distances of the order of the atomic spacing, whereas the conventional 
methods require diffusion distances of ~ 1/mm or so. In order to ob- 
tain the latter in a reasonable time, experiments have to be conducted 
at high temperatures where the jump frequency is large. 

The internal friction methods are therefore of importance in any 
study of the effect on diffusion rates of any condition of the material 
which is appreciably influenced by the temperatures at which conven- 
tional measurements are made, in a time short compared with that 
required for their execution. Such conditions are the presence of cold 
work and of radiation damage as well as of a non-equilibrium vacancy 
concentration. 


SECTION V 


THERMODYNAMIC AND Kriyetic THEORY OF CHEMICAL DIFFUSION 


Darken’s Equations 

DaRKEN,’ it will be remembered, in order to provide a phenomenological 
description of binary diffusion which would embrace the Kirkendall 
effect, proposed that a separate and independent diffusion coefficient 
should be ascribed to each of the two diffusing species defining their 
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respective rates of flow relative to a system of markers fixed in the 
system. It is usual to assume that the markers define lattice planes so 
if we generalize DARKEN’s proposal, the rates of flow J, relative to the 
lattice of the several species of an n component system are supposed 
given by 

J, = — Djdc,/dx joes GE 


i 


In the case of a binary alloy these equations lead, subject to the 
previously discussed assumptions, to equations (1) and (2) of Section I. 

Now equation (39) implies that the necessary condition for diffusion 
to occur between two points is that there exists a concentration 
gradient between them and assumes the simplest mathematical expres- 
sion of this fact by putting the rate of flow proportional to the concen- 
tration gradient. The constants of proportionality D, then characterize 
the diffusion behaviour of the system. Although this is a reasonable 
deduction from experience, it is thermodynamically more sound to 
expect that the essential condition for diffusion of any one species to 
occur between two points is that there should exist between them a 
difference in the chemical potential of that species, for a system is only 
in equilibrium when the chemical potential of each species present is the 
same throughout the system. The simplest mathematical expression 
of this fact is to assume the rate of flow of any one species to be propor- 
tional to the gradient of its chemical potential u,. The rates of flow of 
the several species are then to be given by the equation 


J, = — L,ou,/dx eons Oe 


The diffusion behaviour of each species may now be specified by the 
corresponding coefficient L, but it is more common practice to retain 
equation (39) as a definition of the diffusion coefficients and to express 
each D, in terms of the quantities appearing in equation (40). In this 
way we find that (see PMP.1) 

») 
L, ome = RE (1 a 


c, o log ¢, 


o log y; 


4] 
o log ¢, (") 


where use has been made of the usual expression, “4, = const + RT’ 
log c,y, for u, in terms of the activity coefficient y,. The term L,/c, 
corresponds to the ‘‘mobility”’ G, in terms of which the phenomenological 
theory of diffusion was discussed in PMP.1. 

These phenomenological equations (40) and (41) are frequently used 
in preference to equation (39) in dealing with diffusion, particularly in 
liquids, and although their use in describing metallic diffusion processes 
has been suggested from time to time in the literature, little application 
appears to have been made of them until the work of BrRCHENALL and 
Meu and of DarKEN.® 
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The first important application of these equations, by DaRKEN, was 
to provide a relation between the individual chemical diffusion coeffi- 
cients D, and the corresponding self-diffusion coefficients D,*. For this 
purpose DARKEN assumes that the mobility L,/c,, defining the average 
velocity of flow by diffusion under unit free energy gradient (see 
PMP.1), is the same, for any one chemical species, under condition of 
both self-diffusion and chemical diffusion, so that differences between 
D, and D,* arise entirely from differences in the chemical potential 
gradient in the two cases. In self diffusion the chemical potential 


gradient involves only the entropy of mixing so that 


D,* = RTL, 


With the above expression for D,, we then find 
D D,*(1 + 2 log y,/d log c,) 
relating D. and D,*. 

For the special case of a binary diffusing system we may now relate 
D,* and D,* with the chemical interdiffusion coefficient D and the 
rate of marker movement v (equations (1) and (2)) by making use 
of the Gibbs-Duhem relation for a binary system. 0 log y,/d log c, 

3 log y,/2 log c,. 

Thus 

D = (N.D,* + N,D,*)\(1 + gy, A od gta 


and > & 
} - DL, l + o log y,/d log ¢, » | Oe — . (48) 


In support of equation (44) DarRKEN quoted the earlier results of 
Jounson,” who measured the self-diffusion coefficients D,,.* and D,.* 
of gold atoms and silver atoms respectively in a chemically homo- 


geneous alloy of 50 per cent composition, and also the chemical inter- 
diffusion coefficient D at the same composition. For this latter purpose a 
diffusion couple was used the compositions of the two halves of which 
differed only slightly from their mean of about 50 per cent. DarRKEN 
showed that Jounson’s values of D,*, D,* and D were satisfactorily 
related by equation (44) with \, N, t and with the already known 
thermodynamic data on Au, Ag alloys being used to calculate the last 
factor. Equations (39), (40), (41) and (43) as well as (1) and (2) are 
generally referred to in the metallurgical literature as Darken’s equa- 
tions, although most of them appear to have been proposed indepen- 
dently by workers in other fields.“* 

More recently Le CLarke and Barnes” have pointed out that the 
marker shift actually observed in Au, Ag alloys by CORREA DA SILVa 
and MeuHw* agrees quite well with that calculated from equation (45), 
using JOHNSON’S measurements of D,* and D,*. This calculation 
required a rough estimate of the value of dc,/oz in CORREA Da SILVA 
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and MEHL’s experiments which was not reported by them. Srrru et 
als, 128 actually measured the marker shift in a Au/Ag couple identical 
with the one used by JoHNSON and used his reported data to give the 
exact value of dc,/dx. Their observed shift of 2-90 .10-% cm agrees 
fairly well with the value of 3-10 . 10-* cm calculated from equation (45). 

These measurements on Au/Ag alloys do then appear to provide a 
fairly satisfactory check of DARKEN’s equations but more extensive 
experimental work of a similar nature would be very welcome and, 
indeed, necessary to establish the equations firmly. A case might be 
made that any discrepancy between experiment and calculation based 
on Darken’s equations was to be attributed to uncertainties in the 
measured value of the chemical diffusion coefficients arising from 
porosity and lateral changes in the dimensions of diffusion samples as 
discussed in Section I. Thus Serru and Kotrmann?! point out that the 
observed value of v in their Au/Ag samples is /ess than the calculated 
value, as is to be expected if the discrepancy arises from these causes. 

The relation between D,;* and D, is important for many reasons. It 
provides the essential link between the theoretically most simple and 
fundamental quantity underlying all diffusion phenomena, namely the 
average number of random jumps made per second by an atom in a 
homogeneous alloy, derived from D,*, and the practically significant 
rates of chemical diffusion given by D,. The form of the relations will, 
in general, vary with the mechanism of diffusion and so a choice in 
principle may be made between proposed mechanisms by experimental 
study of the corresponding relations (see PMP.1). 

A number of recent papers!®. 5°, 2°, 120, 121 have been concerned with 
the validity of DARKEN’s equation and of the assumptions upon which 
they are based. One important feature omitted from DARKEN’s treat- 
ment of diffusion is the part played by the vacancies, when diffusion 
occurs by this process. There is a definite nett flux of vacancies as the 
Kirkendall experiments show and it is natural to inquire to what extent 
this vacancy flow affects the validity of equations (39) and (40). It is by 
no means apparent why, as equation (40) states, the rate of flow of 
atoms of one species should be proportional only to the chemical 
potential gradient of atoms of that species when there is a nett flow of 
the vacancies whose movement alone can lead to a flow of the atoms, or 


to put it another way, when there is such an intimate coupling between 


the flow of atoms and of vacancies. 
A study of the problem is conveniently made in terms o ISAGER’S 
A study of th bl ntly le in tert f On EI 
general theory of irreversible phenomena."*—"* 


General Thermodynamic Theory of Diffusion 
The essential assumption is that the rate of flow J, of each of the 


several n species present in a multicomponent system is linearly 
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dependent upon the chemical potential gradient of each of the species 
present, including the vacancies, 30 that we have as the ‘thermodynamic 
equations of motion” 


(46) 


where for short we write X, as the chemical potential gradient of 
the jth species, X »/ Oz. 

The terms L,, correspond closely to the terms L, in equation (40) 
and, therefore, to the diffusion coefficients but the “‘cross-terms”’ 
L i = J) take account of possible couplings between the several 
diffusion flows. It may be shown""* that as a consequence of the principle 
of microscopic reversibility, a symmetry exists between these cross 
terms in equation (46) expressed by the “Onsager Relations.”’ 


L P = )) : ; (47) 


‘ 


(the same is not in general true when equation (46) is rewritten with 
dc,/dx in place of du,/dx). 

The L,, are functions of composition and will in general vary accord- 
ing to the choice of plane with reference to which the J, are measured. 
Let this plane be such that the nett flow of all species across it is zero. 


Then SJ, 0 . (48) 

Since for each atom moving from one lattice plane to the next there 
is @ vacancy moving in the other direction, equation (48) is satisfied 
for a plane fixed relative to the lattice, as required for comparison of the 
J, given by equation (48) with those given by equation (39) or (40). 

In addition to the ‘Onsager relations,’’ equation (48) leads to a further 
reduction in the number of independent coefficients L,, for it may very 
simply be shown’ to lead to the result 


(49) 


(50) 
Let us now regard the vacancies as the nth component and put n = v. 


With the aid of equation (49) we may now rewrite the equation (46) 
for J, as 


(51) 


where — X, and — X, are the chemical potential gradients of the jth 


component and of the vacancies respectively. 
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The equations (51) give in the most convenient general form the rates 
of flow by diffusion of the vacancies and of the n — 1 atomic species. 
If the density of vacancies at each point in the system is such as 
corresponds to them being in equilibrium with the lattice, then the 
free energy of the vacancies is a minimum for small changes in their 
number, so that uw, and also X, = 0. Equation (51) now becomes 


t 


* 
=> » ¢ - ee. 5 eee 


arrive at 
(53) 


Neglecting cross terms but not assuming the vacancies are in equili- 
brium we find 


J, LAX,.—X.) ata a ee 


Equation (53) proposes that each J, is determined solely through the 
corresponding X,. Such phenomenological equations are identical with 
equation (40) on page 321. We now see that their validity rests on the 
assumption that the vacancies are everywhere maintained in thermo- 
dynamic equilibrium with the lattice and that the cross terms, which 
give a contribution to the flow of each component proportional to the 
chemical potential gradient of each of the other components, are 
negligible. 

It may well be in many cases that the vacancies are everywhere in 
thermal equilibrium so that at least equation (52) is valid. This will 
certainly be true in self-diffusion experiments, where chemical homo- 
geneity is maintained and also in chemical diffusion experiments if there 
are sources or sinks for vacancies, such as dislocations, adequate to 
maintain equilibrium. 

As regards the cross terms, at least those representing coupling 
between the flow of vacancies and the flows of the several atomic species 
are not to be neglected, as is to be expected from the very nature of 
vacancy diffusion. The nett flow of vacancies is certainly not zero as the 
Kirkendall effect shows and being given by 


may be regarded as determined entirely by the cross terms L,, (j < — 1). 
The fuller and more general phenomenological theory expressed by 
equation (46) is then necessary because of the observed nett vacancy 
flow. 

But no convincing demonstration has yet been given of the part 
played in vacancy diffusion by the cross terms between different 
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atomic species, and in particular of whether they are zero or sufficiently 
small to be neglected. Obviously, for an exchange or ring diffusion 
mechanism, the cross terms would certainly be significant 

It is to be noted that a simple diffusion coefficient D, as given by 
equation (39) can only in general be detined for each species when cross 


terms are neglected and equation (53) is valid. With non-zero cross 
terms then it is not possible in an n component system to obtain any 
equation corresponding to equation (43). 

However, in the simple but important special case of binary diffusion 
the rate of flow of each species may, when the vacancies are in equili- 
brium, be written as proportional only to its own concentration gradient 
even when cross terms are not ignored. We may, therefore, obtain 
diffusion coefficients for the two species, so that the use of DARKEN’s 
equations (1) and (2) is consistent with this more general theory. But 
DARKEN’S relation (43) and, therefore, (44) and (45) only follows when 
cross terms are zero. These results are simply shown as follows. 

In a measurement of the self-diffusion coefficient of one component 
of a binary alloy we are dealing with a system of four identifiable 
species, inactive A atoms, active A* atoms, inactive B atoms, and 
vacancies. Denoting these four species respectively by suffixes 1, 2, 3, 4 
the “thermodynamic equations of motion” are’! 


A self-diffusion coefficient measurement requires a determination of 
J, in a chemically homogeneous alloy of A and B, i.e. in which X, = 0 
and c, +c, = c,, the uniform and constant total concentration of A 
atoms. If we assume the vacancies are equilibrium X, = 0 and 


J, = LyX, + LeeX, ‘+ as oe 


The Gibbs-Duhem relation in the form Sc,X, = 0 enables us to 


eliminate X, from (56) and so obtain J, in the required form, from which 
we find for the self-diffusion coefficient of A atoms in the alloy D,*, 


D,* = — RT | Ly — 2 Lys) / ¢ ote wa ee 


The further condition that J, + J, = 0and J, = 0 lead to the relations 


Cols, — CL. = 0 ooo se SF 
and 
Co(Ly, + Le) — (Les + Ly.) = 0 2c es 
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Use of (59) enables us to write D,* in the more symmetric form! 
D,* = — RT {Ly + Los a 


A measurement of the chemical diffusion coefficient of A requires a 
determination of the total flow J, + J, of A in an alloy in which a 
chemical concentration gradient of A exists. In this case, not the sum of 

and c, but the ratio c,/c. remains constant at every point. Thus, as 

and c, are concentrations of chemically identical species 
¢,X, + ¢,X, = ¢,X, soe (61) 
where — X, is the total chemical potential gradient of A atoms, and 


X,=X,=X 


With these and the Gibbs-Duhem relation we find for D, 


o log Val 


D, = — RT {(Ly + Le 


a 22 


7 21,2) Ca 


2 log c,) 
(62) 
From (60) and (62) the general relation between D, and D,* becomes,*° 
using (58), 


RT c, 


Cc 


(c,Le, — C3, 


> 
. 


\ | , :, 
oe 
J ( 2 log c,) aoe 


D,* + 
Cole 

It is clear that this is equivalent to DARKEN’s relation between D, 
and D,* only when L,,/c, = L,./c, or when the cross terms are both 
zero. The first condition would mean that the diffusion behaviour of the 
two species of atoms are identical whereas the Kirkendall effect clearly 
shows that such is not the case. (The analogous equation (58) is a direct 
consequence of the identity of A and A* atoms.) This condition would, 
of course, hold for a mixture of three identical isotopes; since the 
thermodynamic factor is unity in this case we arrive at the trivial 
result D, = D,*. 

We have seen that there is a little experimental evidence in support 
of equation (43), which therefore suggests that the cross terms may be 
unimportant. But a theoretical justification of this conclusion, indi- 
cating that cross terms are, in fact, zero or at least sufficiently small to 
be neglected, can only come through a detailed study by kinetic methods 
of the process of vacancy diffusion in binary alloys. Although several 
attempts have been made to construct such a theory, the problem is not 
yet sufficiently clearly or convincingly resolved to warrant a full 
review of them. 

It is interesting to note in passing that the cross terms do play an 
important part in the phenomena of diffusion in strong electrolytes." 


Ot led 
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Kinetic Theories of Diffusion 

BARDEEN” has shown how equation (54) may be derived by simple 
kinetic considerations, so that DaRKEN’s equation (43) again follows if 
the vacancies are in equilibrium, but the assumptions introduced into 
the analysis have been criticized by Serrz, who regards them as 
“exceedingly restrictive.” BARDEEN and HERRING” in a later paper 
claim to have avoided the more important of these assumptions, with- 
out influence on the results, but their treatment is not without objec- 
tions. SHUTTLEWORTH"™ has considered a model in which one species 
of atom exchanges very much more readily with the vacancies than does 
the other, and finds a relation between D, and D,* which differs from 
equation (43) in that the factor (1 + 0 log y,/d log c,) is replaced by 
(2 + O log y,/o log c,) so that cross terms are clearly of importance in 
such a case. 

It would hardly be profitable in the present incomplete state of the 
subject, to describe all these various attempts at the problem in detail, 
but Serrz’s*. ® more general approach may be mentioned for it reveals 
in a fairly simple way and with the fewest assumptions, the requirement 
that the vacancies must be in equilibrium for the rates of flow of the 
two species to be expressible by diffusion coefficients. 

SerTz was the first clearly to point out that the Kirkendall effect was 
consistent with a vacancy mechanism of diffusion if it was assumed that 
vacancies exchanged more readily with one type of atom than with the 
other. He introduces the quantity 7,(£n) defining the probability that 
a vacancy on a plane where the composition is § will exchange in unit 
time with an A atom on a neighbouring plane where the composition is 7. 
A similar quantity 7,(&7) defines the corresponding probability for a 
vacancy exchange with a B atom. The total probability that in unit 
time a vacancy will jump from plane | to plane 2 is clearly 7, + 7,, 
which sum we denote by 7. 

We suppose diffusion to occur between two neighbouring planes 
| and 2 separated by a distance A and whereon respectively the planar 
densities of A atoms, B atoms, and vacancies are 2,,%,9, %y)%y0) My %yo- 
If the total number of sites n remains constant during the diffusion and 
since n, is very small the composition on each plane is completely 
determined when only n,, and n,, are given. 

The nett flow of vacancies from plane | to plane 2 is 


J = Ny t(MyyNaq) — Nyg™(Nqo%a1) 





Since n,, and n,, differ only slightly from their mean value of n,, 7 
may be expanded to the first order in 4 to give 


F dn, . On  dn\ . dn, 
J, = — 2(n,n,) A+n,(— — == —— | A - 
ae , oF cn dx 











DIFFUSION IN METALS 


We thus see that even when there is no concentration gradient of 
vacancies a flux of vacancies exists solely on account of the chemical 
concentration gradient. 

The flow of A atoms is clearly 


J = Nye q(Nqo%q1) — NyyTa(NaiNao) 


which we may write as 


, dn, 
A om 


(64) 


If the vacancies are everywhere in thermodynamic equilibrium with 
the lattice, then the vacancy concentration is a function only of chemical 
composition and we may write 


dn, 


J, in equation (64) may now be written proportional to the concen- 
tration gradient of A atoms, so we may obtain the chemical diffusion 
coefficient D, as 

dn, 


= — 7,(n,n,) —— 2? —j}42 ... . (65) 
dn, 


a 
where 7,/A has been replaced by c,, the volume concentration of A 
atoms. 

We see then again that equation (39) and, therefore, also (1) and (2) 
are valid for binary diffusion only when the vacancies are maintained 
in equilibrium. 

It is readily shown that D,* is given by 


a? 


D,* = n,7,(nn,) . 22/n, a = Se 


so that the expression for D, and D,* are intrinsically different in form 
for D,* involves only 7, whereas D, involves also its first derivatives. 

This represents the least objectionable justification by kinetic 
methods available so far for the use of equations (1) and (2) in describing 
binary diffusion; it is only when simplifying assumptions are made as 
to the form of 7 that a relation between 7 and its derivatives and, 
therefore, between D, and D,* may be obtained. It is, in effect, at 
this stage, in the choice of simplifying assumptions, that the various 
attempts at constructing a kinetic theory of diffusion, diverge. And 
none of these assumptions is sufficiently unrestricted to lead to a 
convincingly general result. 
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NUCLEATION 


J. H. Hollomon and D. Turnbull 


INTRODUCTION 


In a large number of chemical reactions there are no recognizable 
centres at which the reaction begins and from which it propagates. In 
contrast to this mode of reaction most phase transformations begin at 
certain sites and propagate from these places. An element of the system 
undergoes no recognizable change until it is swept over by a boundary 
between the transformed and untransformed phases propagating from 
the centre at which the transformation began. 

The initiation of transformation at certain recognizable centres is 
known as nucleation, and the propagation of the transformation from 
the centres is called growth. Even though the nucleation of transforma- 
tions has been recognized and described for at least two centuries, until 
recent years it was difficult to make reproducible and quantitative 
measurements of the rate of nucleation. During the past two decades 
semi-quantitative measurements have been made on the rate of nuclea- 
tion of liquid droplets in supersaturated vapours. These measurements 
largely confirmed the predictions of the theory of homogeneous nuclea- 
tion developed by VoLMER, ‘BreckER and co-workers. More recently the 
rate of nucleation in supercooled liquids has been measured quantita- 
tively. Also our insight into the nature and mechanism of nucleation 
catalyzed by extraneous surfaces has been deepened by recent experi- 
mental and theoretical developments. Even though the understanding 
of nucleation in crystalline media is much less advanced than for 
gaseous and liquid media, an important theoretical groundwork for this 
understanding has been developing. 

These advances in the understanding of nucleation have spurred great 
experimental and theoretical activity in the past few years. Descrip- 
tions of some of this activity are given in recent reviews by SMOLU- 
CHOWSKI,' TURNBULL and HoLtitomon,” HoLiomon,’? Brapvey,® and 
La Mer, Pounp and Ress.‘ 

In this paper we shall critically review the subject of nucleation in 
general but particularly emphasize the rate of nucleation in reactions 
involving crystalline phases. 

Consider the formation of a phase # within a parent phase «a by 
the process of nucleation and growth. If the transformation occurs 
isothermally the size of each f# region, D, is some function of time 
D = fit). If no change in composition accompanies the transformation 
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} propagates at a constant rate, G, after nucleation and D = G(t — r), 
where +, usually a positive intercept on the time axis, is the nucleation 
period. Of course, all regions of # do not nucleate at the same time, so to 
describe the course of the transformation we must know more than one 
value of r. It is most convenient to define the nucleation frequency, /, 
which is the number of 8 regions formed per unit of time per unit of 
untransformed volume. / is reciprocally related to a mean value of 
r= 7. To measure / experimentally we effectively must measure 
D = f(t) for a large number of # regions at a given stage in the trans- 
formation, extrapolate all these to zero, and determine the number, J, 
of intercepts per unit time. Then / = V/V, where V is the untrans- 
formed volume corresponding to the particular time. In order for the / 
value to be of significance V must be very large. Where a composition 
change accompanies transformation, as in precipitation, often it is 
found that the growth of f is diffusion limited and D = G'*(t — r)'”. 
For such transformations / is determined similarly. 

The kinetics of a large number of isothermal transformations, 
probably the majority, can be satisfactorily described by the two 
parameters / and G. Knowing / and G and taking account of the 
geometrical problem of the impingement of growing f regions, the 
dependence upon time of the fraction of the x sample transforming to 
} may be derived. Derivations of this dependence have been given by 
Jounson and Mesi’® and Avrami* (now Metviyn). These derivations 
recently have been reviewed critically by BURKE and TURNBULL.’ 

Examples of phase transformations satisfactorily described as 
nucleation and growth processes are vapour +> liquid, vapour +> crystal, 


liquid +> crystal, precipitation, allotropic transformation, and in a few 
- order. A variety of processes that are not phase 


instances disorder 
transformations in the ordinary sense can also be described by nuclea- 
tion and growth kinetics. An example is the recrystallization of cold 
worked metals treated in detail in an earlier review.’ 

Nucleation that takes place at random within homogeneous elements 
of volume of a phase is designated homogeneous nucleation. Experience 
indicates that transformation nuclei often form preferentially at various 
interfaces in the system, such as free surfaces, internal surfaces, con- 
tainer walls, suspended crystallites, etc. Such interfaces are said to 
catalyze nucleation and this process of nucleation is called heterogeneous 
nucleation. 

We will describe and explain nucleation in phase transformations 
categorized according to its complexity. In many phase changes, such 
as condensation of a pure vapour or crystallization of a supercooled 
liquid, only a change in structure is involved. The crystallization of a 
phase from a solution involves a change in structure and composition. 
Volume changes accompany most phase transformations; therefore, 
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when a phase forms in a crystalline medium strain energy is developed 
due to the transformation. If no composition change is involved, as in 
allotropic transformations in pure crystals, there is structure and strain 
to be considered. The precipitation of a phase from a solid solution is 
the most complex type of transformation in that it is accompanied by 
changes in structure, composition and strain. 

STRUCTURE 
Nucleation of Liquid from Vapour 
General Theory—Even though our primary purpose in writing this 
review is to describe and explain nucleation in condensed metal systems, 
it is desirable to re-examine the theory for nucleation of liquids from 
supersaturated vapours and its experimental verification. The theories 
for the nucleation of a liquid from a supersaturated vapour have 
influenced the development of theories for nucleation in condensed 
systems. 

The theory for homogeneous nucleation of liquids in vapours as now 
generally accepted is largely due to VOLMER and WEBER*® and BECKER 
and Dorinc.*t VoLMER and WEBER supposed that dense clusters of 
molecules exist in equilibrium in the vapour and that some of these 
clusters (called embryos) may become nuclei for the growth of liquid 


droplets. The essential assumption of the theory is that the standard 
free energy, AG, of these embryos can be calculated as though they are 
liquid droplets having the same liquid-vapour energy per area, co, and 
the same free energy per volume, AG,, with reference to the vapour 
phase, as the macroscopic liquid phase. Thus, for spherical embryos of 
radius, r, at a given temperature and external! pressure, 


AG = 4nr*o + (4nr*)(AG,), << eA (1) 


where o and AG, are independent of r. 


In general, 
AG = aoi*® + bAG,;, iy ae 


where i is the number of molecules per cluster and a and 6 are factors 
determined by the shape and density of the cluster. 

A second assumption of the theory® is that embryos are born, grow 
to nuclei, or disappear by a sequence of bimolecular reactions. Desig- 
nating « as a molecule of vapour and f, an embryo of liquid containing 
i molecules, these sequences of bimolecular processes can be represented 
in the following way : 


Po tae a ae (3) 
Piya tae 
t Called hereafter the V.W.B.D. theory. 
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With these assumptions the absolute frequency of nucleation per 
volume, J, of liquid nuclei in supersaturated vapours can be derived. 

Let the number of embryos per volume containing + molecules in 
equilibrium with vapour at a given pressure P and absolute temperature 
T be n,. Then assuming ideal mixing of embryos of all sizes and that 


i=<« 


n, > > n,, where n, is the number of single molecules per volume, it 


follows that : ' 
“ n, = n, exp [— AG/kT} soos 
When AG, < 0, AG = f(r) (equation 
(1)) has a maximum (see Fig. 1): 
AG* = 16moa*/3(AG,)*, . . . . (5) 
at 
r=or* =— 2 AG,,. — a (6) 
Embryos having a radius smaller than 
this critical value will on the average 
disappear, but those larger will grow into 
a \ liquid droplets. An embryo having the 
Fig. 1. eee enceny of on eubeve critical radius is by definition a nucleus. 
of a new phase asa function The number of embryos per volume- 
of its radius time becoming nuclei is the nucleation 
frequency,f J. 
I is the net frequency per volume of the bimolecular process : 





Bis T 2 = Bios, 


where i* is the number of molecules in the nucleus. 
Assuming with VoLmeR and WEBER an equilibrium distribution of 
embryos of all sizes, the number of nuclei per volume, n,,’, is given by: 


Nie = Nie = n, exp [— AG*/kT’). (+ os Ue 


Then neglecting the reverse reaction indicated in (7) we obtain the 
expression : 
I = zs*n.., ee 


' 


where s* is the surface area of the nucleus and z is the collision frequency 
of vapour molecules with the surface of the nucleus per area. From 


kinetic theory z = P/V 27mkT' and substituting (8) into (9) we have: 


I = [P/(2amkT)'*)s*n, exp [— AG*/kT]. . . . . (10) 


Becker and DérrinG improved this treatment by taking into account 
the reverse process indicated in (7), and the decrease in concentration of 


+ This definition does not correspond exactly to the operational definition of J given 
in the introduction. However, on the basis of the V.W.B.D. theory the difference in the 
two J values is negligible in most instances. 
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embryos during a transition from the equilibrium concentration because 
of the rapid growth of nuclei to droplets. 

Schematic embryo size distribution curves calculated from these two 
points of view are compared in Fig. 2. 
The expression derived by. Becker and 
DORING is: 

[= 2n,8*(— bAG,/27i*kT)'” exp (—AG*/kT) | 
(11) 


Excepting for the factor (— bAG,/2zi*kT)'”, 
this equation for J is identical with equa- 
tion (10). ; 

The basic assumption that the free energy Fi: 2- Comparison between 

.-* VOLMER-WEBER and BECKER- 

of embryos can be calculated by assuming — paping distributions of 
them to be small droplets of the macro- embryos with respect to size 
scopic liquid phase, recently has been ex- 
amined critically by Kirkwoop and Burr" and Reiss." The primary 
criticism derives from the fact that the nucleation rate can be per- 
ceptible only when the nucleus diameter is of the order of 10 molecular 
diameters or less (equation (6)). When the size is this small there is 
some ambiguity in the definition of surface tension. To~man™ and 








gan eS a, 
“TRANSITION ~ “ 
ZONE 


| 
= 


DENSITY 








Fig. 3. Variation of density of matter across transition zone 
between liquid droplet and vapour 


Krrkwoop" have developed theories for the variation in the density of 
matter across the liquid-vapour interface which predict that the density 
changes from the liquid value to the vapour value in a transition zone 
(see Fig. 3), of the order of 2-5 molecular diameters in thickness. 

As a consequence of this transition zone, these authors show that the 
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surface energy o, of a small droplet of radius r should not be identical 
with o =, for plane surfaces but should be related to ao, by the 
equation : 


os 


oO : F,/(1 -— 2A To); > é¢ 6 (12) 


where 6 is the difference between the value of r at the Gibbs surface of 
tension and its value (r = r,) where the superficial density of matter 
vanishes. According to ToLMAN and Kirxkwoop, 6 is of the order of 
one to two molecular diameters—hence, when 4/r, is 0-10 to 0-20, a, is 
0-8 to 0-7 ao. 

Krrkwoop and Burr’ and Burr" though expressing reservations 
about the validity of using macroscopic thermodynamic parameters to 
describe the properties of nuclei, have modified equation (5) by taking 
into account the variation of o with curvature. Burr’s development 
gives, instead of equation (5), the following expression for AG*: 


AG* = —e \ToLy - 26( AG.) | — y- . ‘eee (13) 


° 


where y = a, + (0,7 + 20,dAG,)**. 
The ambiguity in the definition of the energy of interfaces having 
radii of curvature of only a few molecular diameters is not confined to 
liquid vapour interfaces but is inherent in the corresponding definitions 
for all types of interfaces. This problem must be reckoned with in the 
treatments of nucleation of crystalline phases in condensed systems. 
The V.W.B.D. theory and its modification tentatively proposed by 
Krrkwoop and Burr make substantially different predictions of the 
value of AG, at which the rate of nucleation becomes perceptible. 
Alternatively, the V.W.B.D. theory predicts that the values of the inter- 
facial energy, co, calculated from rates of nucleation should be in close 
agreement with that of o,, measured for flat bulk liquid surfaces while 
the Kirk woop-Bvrr modification predicts that ¢ should be substantially 
less than cp». 


Transient N ucleation—BecKER and DORING’s final equation for / was 
derived on the assumption that J is time independent. However, when 
the driving free energy of a phase transition is changed very rapidly 
from one level, (AG,),, to another, (AG,),, there is a transient during 
which the embryo distribution characteristic of (AG,), changes to the 
distribution characteristic of (AG,),. During this time the nucleation 
frequency /, is time dependent. 

The earliest formal treatment of transient nucleation is due to 
ZeLpovicu,” who derived an approximate relation J, = f(t). Tury- 
BULL"* pointed out the importance of transients in solid-solid transforma- 
tions and evaluated /, = f(t) precisely by a numerical method for 
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certain simple cases. More recently KANTROWITZ"’ and PROBSTEIN™* have 
developed the theory for transient nucleation of liquid in supersaturated 
vapours, which is of importance in interpreting the results of experiments 
in which the supersaturation is increased with great rapidity. With 
some simplifying assumptions ZELDOvIcH derived from the BECKER- 
DérNG theory the following equation for /,: 


I, ~ I exp [— i**/428*/] = Jexp[—r/t],... . (14) 


where J is the steady nucleation frequency. KantTRowr1rTz takes into 
account the possible departure of the accommodation coefficient from 
unity and the difference in temperature between the embryos and their 
surroundings due to the heat liberated by the condensation. With 
these modifications, zs* in equation (14) is replaced by a function of the 
appropriate accommodation coefficients and heat content quantities. 
Without making the corrections introduced by Kanrrowrrz the order 
of time 7 required for the transient in the nucleation of liquid droplets 
to expire is a microsecond. 


Formal Theory of Heterogeneous Nucleation—It is probable that in 
most phase transformations the formation of nuclei is catalyzed by 
various foreign surfaces present in the system. In experiments designed 
to determine the rate of nucleation, the most difficult problem is the 


elimination of these surfaces or the specification of the nature of the 
substances effecting nucleation catalysis. 

VoLMER”’ started with the basic assumptions of the V.W.B.D. theory 
and developed a formal theory for heterogeneous nucleation of liquid 
from supersaturated vapour. Suppose that a liquid embryo comes to 
equilibrium on a plane foreign surface with a contact angle 6, it follows 
that : 


Ogr = Sts + Cyr COS 8, «ans Sa 


where dg, is the interfacial energy per area between vapour and 
catalyst, o,, is the interfacial energy per area between liquid and 
catalyst, and o,, is the interfacial energy per area between liquid and 
vapour and is identical with o in the theory of homogeneous nucleation. 
The equilibrium form of the embryo is a spherical cap and the free 
energy of formation is: 


AG, = [rr*o,,7 + (xr*/3)AG,][2 — 3 cos 6 + cos*® 6], . . . . (16) 
where r is the radius of the sphere. For all 6 > 0, AG, goes through a 
maximum having the AG, co-ordinate. 

AG,* = AG*f(6), » ae ae 
where f(@) = (2 + cos @)(1 — cos 6)*/4, and AG* is given by equation 
(5). We may suppose that the embryos are in equilibrium with atoms 
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adsorbed on the catalyst surface from the condensing vapour. If the 
number of adsorbed vapour atoms is n,/area, the equilibrium number of 


embryos per area of a given size on the catalyst surface, n, , is 


n, =n, exp AGf(0)/kT’}. eee. oe 


Following a development analogous to that in deriving equation (11) 
we find approximately for the nucleation frequency per area of catalyst 
surface, /,, the following”™ 


I. cn.s*(— bAG./2ri*kT)'* exp AG*f(6) kT’). mae ee 


Consider a unit volume of vapour containing a unit surface of a 
catalyst characterized by a single contact angle #. Since n, ~ 10-*n,, 
it follows from equations (11) and (19) that for this system, 


logis (/,/Z) 8 + (AG*/2. 3kT)[1 — f(8)). . . . (20) 


I is perceptible when (AG*/2 . 3k7') ~ 25; therefore, /, is of the same 
magnitude or greater than / for all f(#) = 2/3 corresponding to 
6= 102°. When 6 = 0, J, can be perceptible with an infinitesimal 


degree of supersaturation. 


Comparison of Theory and Experiment—Nucleation of liquid phases is 
catalyzed by a large variety of surfaces, and in most vapour-liquid 
transitions the supersaturation is relieved by diffusion of vapour to 
liquid heterogeneously nucleated on these surfaces. In order to obtain 
sufficient supersaturation to cause a perceptible rate of homogeneous 
nucleation the vapour is freed, as far as possible, from suspended 
particles and it is rapidly chilled by adiabatic expansion so that the 
supersaturation ratio P/P,,t builds up rapidly. With these conditions 
the value of P/P, required for perceptible homogeneous nucleation can 
be reached before an appreciable amount of vapour has diffused to 
surfaces on which liquid has been heterogeneously nucleated. 

In experiments on nucleation of liquid droplets™ in supersaturated 
vapours P/P, is progressively increased in a series of different adiabatic 
expansions until a fog of liquid droplets is seen to form in the body of 
the vapour. The significant datum obtained from the experiment is the 
minimum value of (P/P,),, necessary to cause this fog formation. 
Experience indicates that the fog appears in these experiments at a 
fairly sharply defined and reproducible supersaturation ratio. This 
result is in agreement with the prediction of the V.W.B.D. theory that 
I increases sharply with increasing P/P,, e.g. an increase of 2 per cent 
in P/P, may cause a tenfold increase in /. Because of this sharp 
dependence of the rate of nucleation on the supersaturation ratio, a 


+ P is the vapour pressure and P, is the vapour pressure in equilibrium with a flat 
surface of the corresponding liquid. 


340 





NUCLEATION 


large error in assigning a value to J corresponding to (P/P, ),, has little 
effect on the value of o calculated from the nucleation data, i.e. the 
exponential term dominates equation (11). Therefore, in the VoLMER-— 
FLoop type of experiment, (P/P,),, is measured accurately and J is 
estimated within wide limits of uncertainty, within a factor of 100. 
VoLMER and FLoop™ have measured (P/P,),, for various tempera- 
tures for water and some organic compounds. For an ideal vapour, 


AG, = (RT/V)ln(P/P,), — 


where V is the volume per mole of the liquid phase. VoLMER and FLoop™ 
compared their measured values of In(P/P,),, with values calculated 
from equation (11), where AG* is given by (5), AG, by (21), and 
assuming that o is equal to a4. 

Alternatively, o may be calculated from (11), by substitution of the 
measured value of (P/P,),,, and compared with o,. The comparison of 
o with oy, the measured liquid-vapour interfacial energy for infinite 
radius of curvature, is shown in Table 1. 

TABLE 1 


Comparison of measured liquid-vapour interfacial energies with values 
calculated from VoLMER—FLOop experiments. 
After TURNBULL and Hottomon™ 





o 


. Temperature |_—— = . panigsnetmmneta 
Substance K 
Calculated from 


. : Measured 
Vo_mer-FLoop data . 


Water 


Methyl Alcohol 


Ethy!] Alcohol 


n-Propyl Aicohol 


iso-Propy! Alcohol 


n-Butyl Alcohol 


Nitromethane 





With the single exception of the result for methyl alcohol, where the 
deviation is 50 per cent, o and o, agree within 2-3 per cent and well 
within the experimental uncertainty. From these results it is concluded 
that the V.W.B.D. theory describes the frequency of homogeneous 
nucleation of liquid droplets in pure vapours to within the limits of 
accuracy attained in these experiments. The diameter of the nucleus 
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calculated from the VoLMER-—FLoop data is of the order of 5 molecular 
diameters and certainly of the same order of magnitude as the 6 of the 
[fotman and Kirkwoop theories. The data are in disagreement with 
the prediction of Burr's modified theory that ¢ ~ 0-80, for the 
experimental conditions of VoLMerR and FLoop.” 

SANDER and DaMKOHLER™ have measured the rate of nucleation of 
water droplets in supersaturated water vapour at lower temperatures 
than those attained by Votmer and FLoop. Their calculated values of 

ire about 10 per cent less than o, at the lowest temperature of their 
measurements. Further, SanpDER and DAMKOHLER’S o appears to go 
through a maximum with decreasing temperature in qualitative agree- 
ment with Burr's theory However, in the temperature range of 
overlapping measurements SANDER and DaAMKOHLER’S ¢ is about 7 per 
cent less than VOLMER and FLoop’s. Recently Pounp and Maponna™ 
have repeated the experiments on nucleation of water including in their 
measurements the temperature range covered by VOLMER and FLoop 
and SANDER and DamMKOH#LER. Pounp and Maponwna’s values of o 
are in good agreement with those of VoLMER and FLoop and with Go 
down to a nucleus containing about 70 molecules. For i* < 70, 
To rand ati* ~ 50,¢ ~ 0-90, 

WEGENER and Sme_t™ found that the degree of supersaturation 
reached in the very rapid expansion of moist air is much greater than 
would seem to be permitted by the V.W.B.D. theory. KantTrow1Tz 
suggested that the supersaturation in WEGENER and SMELT’S experi- 
ments reaches a very high level because of the time lag in reaching the 
steady state value of / predicted by the V.W.B.D. theory. Kanrtro- 
wiTz showed that these results are precisely described by the approxi- 
mate theory of transient nucleation with a single disposable parameter 

the accommodation coefficient, «, of water vapour molecules on a 
liquid water surface. However, the value of x ~ 5 « 10~ required for 
the agreement seems too low. 

[t appears, therefore, that the kinetics of nucleation of liquid droplets 


in supersaturated vapours is adequately described by the V.W.B.D. 


theory and it seems permissible to ignore the effect of droplet radius on 
its surface energy. The success of the V.W.B.D. theory encourages us 
to explore the consequences of the use of analogous assumptions in 
extending the theory and in describing the nucleation of crystalline 


phases. 


Theory of N ucle ation in Conde nsed Syste m 

Most of the theories of homogeneous nucleation in condensed systems 
start with the basic assumptions of the V.W.B.D. theory. Thus the 
free energy for formation of embryos in condensed systems is assumed 
to be given by an equation identical in form with equation (2) with the 
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appropriate o and AG, values and values of a and } consistent with an 
assumed shape of the nucleus. AG* is given by 
AG* = Ko®/(AG,)* 


where X is determined by the nucleus shape and is 167/3 for a spherical 
nucleus. 

For nucleation in condensed systems the kinetic theory expression, z, 
for the rate of bombardment of the nucleus by molecules of the parent 
phase can, in general, no longer be used. The movement of an atom 
across an interface between two condensed phases usually requires a 
free energy of activation, AG,. On this basis Beckrer® derived a 
general expression for the rate of nucleation in condensed systems. 

Recently TURNBULL and FisHER,” again following the basic assump- 
tions of the V.W.B.D. theory, applied the absolute reaction rate formal- 
ism to nucleation in condensed systems and derived a more quantitative 
expression for J. In terms of the absolute rate theory the frequency, 
v, for transfer of an atom across an interface is: 


vy = (kT /h) exp [— AG ,/kT), a 


where / is Planck’s constant. Following the methods of Becker and 
Dorie they derived an expression for the rate of nucleation : 


I = K, exp [— (AG* + AG ,)/kT] ose Ce 


where K, = n*(ac/9xkT)'*n(kT /h), a (see equation (2)) is determined by 
the nucleus shape, n* is the number of atoms in the surface of a nucleus 
of critical size, and n is the number of atoms per volume of the parent 
phase. 
For spherically shaped nuclei, 
(ao/9nkT)'* = (a/kT)'*(2v/9z)'”, ceo es CO 


where v is volume per atom of the parent phase. 
Following the treatment in the preceding section we may write for 
the transient rate of nucleation in condensed systems : 

I, ~ I exp [— 1**/4n*r1], “ste. Ge 
where v is given by equation (23). Since vy can be very small the transient 
period, t = i**/4n*yv, can be much longer for condensed systems’ than 
for the nucleation of liquid from vapour. 

The temperature coefficient of J, d//dT is found, by differentiation 
of equation (24), to be: 
dI/dT = I{(AG* + AG ,)/kT? — (1/kT)(d{[AG*)/dT)}, . . . . (27 


where AG, is assumed to be independent of 7. J goes through a 
is found by solving the equation 


maximum J = /],,. at T = T,,,,;. Tm; 
(AG*._.. — AG 4) a = [d( AG*) AT |p — Tax? . ° ° ° (28) 
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obtained by setting d//d7 = 0. From the thermodynamic identity, 
(dG/dT),, = — 8S, it follows that 


[d(AG,)/aT}, AS,, —s 


where AS, is the entropy change per volume accompanying the phase 
transformation. Often it can be assumed to a fair approximation that 
AS, is temperature independent whence it follows: 


AG, = AS (T — T'.), oe ee (30) 


where 7’, is the temperature at which the two phases of infinite mass are 
in equilibrium. 

Substitution of equations (22) and (30) into (28) gives a general 
relation : 


(Tmax — T'9)* + [Ko*/AG ,(AS,)*)(T7 
+ 2K OPT ass 


that can be solved for 7’',,,,. When AG, = 0,7 
i Ts 3. 

The formal treatment of VotmeEr for nucleation of liquid from vapour 
on catalytic surfaces has been extended to the nucleation of crystalline 
phases in condensed systems.*” Consider, for example, the nucleation 


of a crystal in a supercooled liquid. If it is assumed that the liquid- 


al 
max T’») 


AG ,(AS,)?=0, .... (31) 
T,/3. I£AG, > 0, 


max 


crystal interfacial energy o,, is isotropic, equations (15) and (16) apply 
with the subscripts SL, SC and LC replacing SV, LS and LV, respec- 
tively, where S refers to catalyst, L to liquid and C to the forming 
crystal having the same composition as the liquid. By a simple exten- 
sion of the treatment of TURNBULL and FisHEr* the following expres- 
sion for /, for crystalline phases in condensed systems is obtained* : 


I, = K, exp [— AG*f(6)/kT), se a 
where for nuclei shaped like spherical caps, 


K, = n*(a/kT)"*(2v/9m)** f(0)}°n (kT /h) exp [— AG ,/kT), 


(33) 


o is the interfacial energy per area between forming crystal and the 
parent phase. 

As has been pointed out earlier*’ transients are as likely to be impor- 
tant in heterogeneous asin homogeneous nucleation. With /,substituted 
for J and using the appropriate values of i* and n*, equation (26) gives 
an approximate relation for the transient in heterogeneous nucleation. 

Let a phase « be in equilibrium with a phase § having the same com- 
position at temperature 7',. Below 7',, 8 is more stable and above 7',, 
xis more stable. Suppose f is heated to a temperature in the range of « 
stability. It follows that at this temperature # is less stable than « 
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on all plane foreign surfaces for which 6 > 0, where 6 is the contact 
angle between f in contact with « and the foreign surface. 

However, VoLMER”® has shown that material held in microcavities 
of suitable geometry in the foreign surface can be more stable as # 
than as «, even in the temperature range of « stability for 0 << 6 < 90°. 
The basic reason is that for 6 < 90° there is a net gain in surface energy 
for that part of the # surface that forms on the surface of a catalyst 
(see equation (15)). Therefore, any geometry of the # form that 
maximizes the fraction a, of the # surface forming on the catalyst 
surface and minimizes the fraction a, in contact with the parent « will 
favour the stabilization of 8. Evidently at 
negative curvature in the catalyst surface 
increases a,/a,. 

A theory for the thermal history pheno- 
menon in phase transformations, based on 
the concept of retention of f in micro- NUCLEATION ON 
cavities, has been derived.?’ As the tem- WATS JREACE 
perature is increased above 7',, the size of | \ 
the microcavities in which f is retained is \ 
decreased and at very high temperatures 
becomes of the order of molecular dimen- 

, . . on the supercooling required to 
sions. After an element of material in 4 ace nucleation of embryos 
microcavity transforms from f to « it can- retained in cylindrical cavities?’ 
not revert to # until the temperature is 
lowered sufficiently for the rate of nucleation of 8 on a flat surface to 
be appreciable. 

Elements of # retained in microcavities nucleate the transformation 
from « to £. In order to serve as a nucleus the radius of the # element, 
and hence the microcavity in which it is retained, must equal or exceed 
the critical radius r,* for nucleation of 6 on a flat surface. Since r,* 
decreases with decreasing temperature it follows that the super- 
cooling, A7'_, will be less the larger is the maximum size, r,,, of the 
microcavity in which # is retained. Since r,, decreases with increasing 
AT.., where A7', = T,,— 7, and T,, is the maximum temperature 
attained in the range of « stability, it follows that A7'_ should increase 
with A7’,. On the basis of these concepts and assuming cylindrical 
microcavities the undercooling should be related to the degree of super- 
heating by the following equation” : 


tan 6 = AT'_/AT. » Ss 


+ 








\ 


Fig. 4. Effect of superheating 


Fig. 4 shows A7'_ as a function of AT’, for a particular value of 6. 

It is evident that equation (34) ceases to have meaning when AT’ 
exceeds the supercooling A7'_* required for a perceptible rate of 
nucleation on a flat element of the catalyst surface. A7'_ is a linear 
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function of A7', for A7T’_< AT'_* but is independent of A7’_ when 
az... > a7.°. 

With suitable microcavities in its surface a substance characterized 
by a comparatively large value of 4 may be as effective in nucleation 
catalysis as another substance characterized by a very small value of @. 
For example, it is doubtful that substances, such as gallium, having very 
complex crystal structures ever crystallize from a liquid without very 
marked undercooling unless nucleation is effected by retained nuclei. 

In heterogeneously nucleated transformations it is possible for the 
preferred nucleation sites to be exhausted. Avramr® (now MELVIN) 
recognized this possibility and developed a formal theory to describe it. 

Suppose there are NV, preferred nu- 
cleation sites per volume in the system. 
These sites might be occupied by re- 
tained nuclei or by catalyst particles so 
small that the whole surface of a par- 
ticular particle will be engulfed by 
growing from the first # nucleus that 
forms on its surface. If there is equal 
' probability that a nucleus will form at 
Fig. 5. Frequency of heterogeneous any of the preferred sites the number V 


nucleation as a function of time per untransformed volume that have 
wi initial w . . 
showing initial build up toward 14+ nucleated in time, ¢, is: 
steady frequency and final falling 
off because of exhaustion of N = N, exp [—/ft], ... . (35) 
nucleation sites*’ 
























where / is the nucleation frequency/site. 
The nucleation frequency/volume is found by differentiation of (35) 
to be: 








I, = Nof exp (— ft). 5 ic: a 


Avram did not consider the possibility of an initial transient and 
his theory requires that /, always decrease with time. When the initial 
transient in /, is taken into account a more general relation : 


I, = Nof exp [— (ft + i**/4n*st)], co eo oe BF 







in heterogeneous nucleation is obtained that is indicated graphically 
in Fig. 5. J, increases from a value of zero to a steady value and then 
declines as the nucleation sites are exhausted. It is evident that the 
entire system may be transformed to # by growth of transient / nuclei 
before the effects of exhaustion have become apparent. 









Nucleation Phenomena in Condensed Systems 


Fracture of Liquids—The breaking of liquids subjected to hydrostatic 
tension takes place by the nucleation and growth of bubbles. Consider 
the formation of a spherical cavity in the body of a liquid subjected to a 
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hydrostatic tensile stress, P (negative pressure). Nucleation of these 
cavities can be described by the theory of nucleation in condensed 
systems. If P > P,, the vapour pressure of the liquid, AG, = — P and 
AG* = 1670°/3P?, where o is the liquid-vapour interfacial energy. 
Following Fisner* the frequency of bubble nucleation is: 


I = n(kT /h) exp [— AG ,/kT’] exp [— 16703/3P?], . . . . (38) 


where is the number of atoms per volume and AG ,, the free energy of 
moving an atom out of the liquid-vapour interface into the liquid, may 
be set equal to the free energy per atom for viscous flow. This formal 
treatment applies only to relatively fluid liquids. FisHEer* has shown 
that in very viscous liquids such as glasses, fracture by the nucleation 
and growth of thin disc-like cavities is a much more probable occurrence 
than fracture by the nucleation and growth of spherical cavities. 

Liquids usually prefer to pull apart at interfaces already present in 
the system rather than by homogeneous nucleation of cavities within 
the bulk of the liquid. Practically, it is very difficult to attain the 
negative pressure necessary to cause homogeneous nucleation of 
cavities before the pressure is relieved by the growth of a bubble that is 
nucleated heterogeneously at an existing interface. In order to mini- 
mize the likelihood of heterogeneous nucleation of cavities, foreign 
particles should be removed from the liquid and a container having walls 
that are strongly “wet” by the liquid should be selected. 


TABLE 2 
Comparison of Theoretical and Measured Fracture 
Stresses of Various Liquids. After FisHer™ and 
BERNATH™ 





Fracture Pressure Atmospheres 


Liquid 





Water 





Ethyl] Alcohol 


Ethyl! Ether 





Benzene 


Acetic Acid 





The criterion for the fracture of a liquid subjected to a negative 
pressure is that the rate of nucleation of cavities (J) must have some 
perceptible value (such as 1/mole-second). In order to test the predic- 
tions of nucleation theory, negative pressures, P.,,)., corresponding to 
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I = 1 mole sec calculated from equation (38) are compared with 
values measured experimentally, P.,,. In this comparison only the 
maximum negative pressures that have been reproduced experimentally 
are pertinent since it is presumed that the pressure required for homo- 
geneous nucleation must be equal to or greater than the maximum 
negative pressure that is observed. The comparison of P.,,, and P,,, 
for various fluids is shown in Table 2, on previous page. 

Although the maximum negative pressure is nearly as large as the 
calculated pressure for acetic acid, in all other cases it is less by a factor 
of 2 to 6. It is probable that cavities were nucleated heterogeneously 
in all the latter experiments. 

The heterogeneous nucleation of cavities takes place according to the 
general theory already described. 


Crystallization of Supercooled Liquids—The supercooling of liquids was 
observed almost as soon as temperature began to be measured quantita- 
tively. FanrennerT (around the year 1725) probably made the first 
systematic study of supercooling. He discovered that water readily 
supercooled several degrees before beginning to crystallize and that 
different samples of water consistently behaved differently. For- 
tunately, from the standpoint of using the ice-water equilibrium 
temperature, as a fixed point in thermometry, ice crystals grow very 
rapidly so that, though different samples of water start to crystallize at 
widely different temperatures, the rapid release of heat associated with 
growing ice crystals quickly raises their temperature to very nearly the 
equilibrium temperature. 

According to VotmeEr’s® historical account it was realized more 
than one hundred years ago that the crystallization of supercooled 
liquids or supersaturated solutions is usually nucleated heterogeneously. 
These bodies are usually present accidentaily and in too small an amount 
to permit chemical identification. 

The evidence indicating that crystal nucleation in supercooled 
liquids is usually induced by accidental impurities has been summarized 
in many earlier publications™:*.*. Perhaps the most important 
evidence is the specificity of the behaviour of different samples of the 
same liquid first observed by FaHRENHEIT and the reduced nucleation 
frequency observed after a liquid has been passed through a fine filter. 

The problem of minimizing the effect of adventitious nucleation 
catalysts in crystallization has been a baffling one and an apparently 
workable solution was not found until recently. 

For more than seventy years it has been known that small metal 


droplets supercool several tens of degrees before solidification. The 
most systematic early study of the phenomenon was made by INGERSOLL 
and MenpeNHALL.* They found that 50 to 100 micron diameter 
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droplets of metals such as gold, platinum and rhodium often super- 
cooled over 100° when melted and cooled on a Nernst glower. There 
seemed to be a rough correlation between the maximum supercoolings, 
AT’, and the melting temperature of the metal. For platinum the amount 
of supercooling was observed to be approximately 370°C. They found 
that A7' was generally greater the smaller the droplet. Their results 
coupled with the common knowledge that large continuous masses of 
liquids, such as gold, that crystallize to close-packed structures rarely 
supercool more than 2 or 3° indicate a tremendous effect of specimen 
size on solidification behaviour. 

Whether InGreRsoLt. and MENDENHALL or any of the earlier investi- 
gators appreciated the large magnitude and importance of this size 
effect is not entirely clear from the literature. 


SCHAEFER*® on water again focused attention on the size effect in solidi- 
fication. VONNEGUT studied dilatometrically the rate of solidification 
of a dispersion of small tin droplets, 1-10 microns in diameter, prevented 
from coalescing by an oxide film. In these experiments the period of 
crystal growth was negligible in comparison with that of nucleation. 
The rate of solidification was not perceptible until the dispersion had 
been supercooled of the order of 105 to 110°. Large continuous masses 
of tin have not been supercooled more than 30°C without crystallization.” 

CwrLone*® and SCHAEFER® observed that snow crystals do not form 
at an appreciable rate in a fog of water droplets (10-20 microns in 
diameter) until a “‘critical’’ temperature 39 to 40° below the normal 
melting point of water is reached. Large continuous masses of water 
have only rarely been supercooled more than 20° without crystallization. 

These experiments clearly confirm the existence of a marked size 
effect in solidification. Thus, the frequency of nucleus formation (/) 
in a given specimen, is a function of the volume, v, of the specimen 
and the degree of supercooling, i.e. k = f(v, AJ’). We may for a given 
liquid define an “‘isokinetic’’ relation as follows”: 


= f( AT), = constant ’ 3 (39) 


With tin the magnitude of the size effect is such that the value of 2 
changes by a factor of the order of 10" for a change of 80° in A7’. 

It has been demonstrated: *. * that only the theory that parti- 
tioning the liquid isolates the effective catalyst particles in a small 
fraction of the droplets seems compatible with all the facts. Suppose 
that the liquid containing m, adventitious nucleation catalyst particles 


per volume is dispersed into m droplets per volume kept from inter- 
communicating by suitable barriers. If m > mp, crystallization nuclei 
in most droplets either must form homogeneously or on the surface of 
added nucleation catalysts, e.g. a crystalline film on the droplet surface. 
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The probability that a particular droplet contains an accidental 
nucleation catalyst is ~ my, m 

The isolation theory can be tested by comparing the isokinetic 
relation it predicts with the isokinetic relation measured experimentally. 
Consider first the very simple case where all m, catalyst particles have 
the same catalytic potency per area characterized by a single value of 
the contact angle, 4. The isokinetic relation for droplets containing 
the average number of catalyst particles per volume is the curve cd 
drawn schematically in Fig. 6. Curve a) represents the isokinetic relation 
for homogeneous nucleation. For large values of v(v \/m) m, > m 
und the measured isokinetic relation will follow curve cd. In the region 
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Schematic isokinetic relation for solidification of liquid droplets 





that may contain only one variety of nucleation catalyst 


vy ~ 1/m, the crystallization behaviour will be very erratic because there 
is an average of only one catalyst particle per droplet. Since these 
particles will be distributed statistically some droplets will not contain 
any, and others will contain several times the average number of 
catalyst particles. Therefore, the measured A7’ will fall on the curve ad 
for some droplets and above or below the curve cd for others. For 
lm, v, the average measured A7' will follow the curve ad. We have 
made an idealized representation (see solid curve Fig. 6) of the predicted 
experimental isokinetic relation. Note that, in agreement with 
nucleation theory, the absolute magnitude of the slope of ad is much 
greater than that of cd 

The experimental evidence“ definitely indicates that there is rarely, 
if ever, so large a discontinuity in the A7’-log v relation as is indicated 
in Fig. 6. Rather the data suggest an almost continuous decrease of 
AT with decreasing log v as shown in Fig. 7. 

To account for this almost continuous variation of A7’ with log v 
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two additional postulates are necessary—first, the m, catalyst particles 
are characterized by a variety of 6 values rather than a single one. 
That is, some catalysts will be more effective than others in promoting 
solidification. Second, the concentration of the more effective nucleation 
catalysts is likely to be smaller than that of catalysts that are less 
effective. From these postulates Levine“ has derived a AT7-log v 
relation that successfully describes the experimental data of Dorscx 
and Hacker“ on the freezing of water droplets. Levine's theory is 
formally identical with WerBuLL’s“ theory for the (see also FisHEer and 
HoLiomon*’) size effect in fracture. The basic postulates, due to 
GRIFFITH, underlying WEIBULL’s theory are that fracture is initiated 


LOG ¥ 





HOMOGENEOUS NUCLEATION 





Fig. 7. General isokinetic relation for a liquid containing a wide 
variety of accidental nucleation catalysts 


by structural defects and, that the density of these defects per area is 
greater the lesser their ability to initiate fracture. These postulates are 
analogous with the underlying postulates of the isolation theory for the 


size effect in solidification. 

It is reasonable to suppose that a liquid may contain a wide variety 
of nucleation catalysts. This variety may be due to chemical differences 
and/or to differences in the catalytic potency of substances due to 
heterogeneous surface structure.”* 

The concentration of catalyst particles will decrease as their effective- 
ness increases only if a few substances can be potent nucleation catalysts 
for a given liquid-crystal transition but a wide variety of substances 
can be catalytic to some degree. There is experimental evidence that 
favours this postulate. 

Probably the most quantitative support for the isolation theory and 
for the theories of homogeneous and heterogeneous nucleation applied 
to solidification are the results of TURNBULL** on the kinetics of solidifi- 
cation of dispersions of mercury droplets coated with various films. In 
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these experiments the temperature range of rapid solidification was 
first measured in continuous cooling experiments. This temperature 
range is relatively insensitive to cooling rate because of the large 
temperature coefficient of the rate of nucleation. Crystal nucleation 


frequencies were deduced from isothermal rates of solidifications 


measured for several of the dispersions. From the isothermal data and 
the measured droplet size distribution it was determined whether the 





BE 


60 Helou 


Fig. 8. Isokinetic relation for solidification of mercury droplets 
coated with various films. After TURNBULL” 


nucleation frequency was proportional to the volume or to surface area 
of the droplet. 

Fig. 8 shows isokinetic relations for the various dispersions corres- 
ponding to a nucleation frequency 10~* sec~'. The solid lines for droplets 
coated with mercury laurate (HgLau), mercury acetate (HgAc), and 
mercurous iodide (Hg,I,) films are precise. Approximate relations for 
droplets coated with mercury sulphide (HgS), mercury stearate 
(HgSt), and HgX were deduced from continuous cooling experiments 
and some isothermal measurements of uncertain interpretation on 
HgSt and HgX dispersions. Chemical evidence suggested that the HgX 
film was a mercury oxide. Also drawn in the figure is the assumed 
course of the isokinetic relation for the bulk mercury sample contain- 
ing only the accidental catalysts. There is a large uncertainty in 
this curve, since it was inferred from only 3 points and the two 
lowest of them give only the upper limit of A7’ for the corresponding 
v value. 
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These results confirm two of the most important predictions of the 
isolation theory. First, that there is no single isokinetic relation that 
describes the size effect in solidification, rather the isokinetic behaviour 
is clearly dependent upon the nature of the film on the droplet surface. 
The second is that the slope of the isokinetic relation for a particular 
catalyst is very small. For example, the isokinetic relation for droplets 
coated with HgLau gives A7' = 70° when extrapolated to v = 1 cc. 
Since 1 cc masses of mercury almost always start to crystallize for 
AT < 4°, it is concluded that they must contain relatively effective 
catalyst particles that become isolated from most of the mass when it 
is dispersed. 

It was established that the nucleation frequency in droplets coated 
with HgLau is proportional to droplet volume and that the kinetic 
coefficient, K,, that describes the data is in reasonable agreement with 
K, calculated from equation (24) predicted by the theory of homo- 
geneous nucleation. Comparison of the experimental and calculated 
K, is indicated in Table 3. These results are compatible with the 
interpretation that nuclei formed homogeneously in the HgLau coated 
droplets. The temperature coefficient of the nucleation frequency gives 
the value o = 31-2 ergs/cm? (see table) when it is assumed that the 
nuclei are spherical. 

The disagreement (a factor of 10’) between the experimental and 
calculated values of K, is outside the experimental error. A possible 
reason for this disagreement is that o and AS, may not be independent 
of temperature as assumed in making the calculation. It was shown” 
that the disagreement can be explained on the assumption of small 
and apparently reasonable magnitudes for the temperature coefficients 
of o and AS,,. It is also possible to account for the disagreement on the 
concept that a crystal nucleus may derive from a structure that is 
somewhat different from the most stable macroscopic structure, charac- 
terized by the normal melting temperature, 7',,. This concept is 
plausible because the structure that minimizes a is not necessarily the 
same structure that minimizes the Gibbs free energy of the crystal. For 
example, let the nucleus be strained an amount « relative to the stable 
macroscopic structure «. Then a is a function of e, o(e), and equation (1) 
for the free energy of forming the crystal nucleus is modified to: 


AG = 4nr*o(e) + (4/3)(7r8)(AG, + ce?) . . . . (40) 


where c is the appropriate elastic coefficient. The macroscopic structure, 
8, from which the nucleus derives will be characterized by a melting 
temperature, 7',,’, which is lower than 7',, and is found by solving 
the equation AG, = AS,AT = ce*®. The free energy of forming the 
nucleus is : 

AG* = fir*, e*) 3 
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where r* and e* are found by solving simultaneously the equations : 
[a( AG) / ar], 0 ss = ae 


[9(AG)/Je], = 0 a3. 


Let the Gibbs free energy for formation of 9 be AG,’; it then follows 


that 
AG,’ AS (T',, T) ASAT". ae 


o(e) is not known; however, we may calculate, from the measured tem- 
perature coefficient of J and substituting (44) into (24), the value of 
T,, that will bring the experimental and theoretical values of A,, into 
agreement. The value of 7',,’ thus calculated is about 11-12°C below 
T’,,. Displacements of the order of 1-2 per cent in the lattice parameter 
of « gives a structure that should melt 11—12° below 7’. 

The nucleation frequency, kp, of droplets coated with HgAc was 
proportional to the area rather than to the volume of the droplet. 
This result is in good agreement with the notion that the formation of 
nuclei in these droplets was catalyzed by the surface film. The tem- 
perature dependence of /, is satisfactorily described by equation (32) 
with the values of 6 and A, given in Table 3. The experimental value 


TABLE 3 


Summary of Data on Isothermal Solidification of Small Droplets 





Log K, Log K, 


Substance 


lL | Theor. 


Theor. Ezxpt'l 


Mercury HgLau : 42-1 35-1 


Mercury HgAc 
Mercury HgsSt 
Mercury Hg, I, 


Tin Tin oxide 





of K, is in excellent agreement with the value predicted by the theory 
so T',,’ is displaced from 7',, by a negligible amount. 

Other results of the investigation of mercury solidification are 
summarized in the table. Solidification isotherms for droplets coated 
with Hg,I, were described on the basis that nucleation was catalyzed 
by particles, characterized by a single 6, suspended in the mercury and 
that either the mercury nucleus derives from a structure melting about 
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30° below 7',, or that the rate controlling step is not nucleation of 
mercury crystals but the nucleation of a phase, possibly of Hg,I,, that 
catalyzes the nucleation of mercury. 

Results for solidification of the Hg(St) and HgX coated droplets 
could not be described by a single value of J or J, at constant tempera- 
ture but could be described by a multiplicity (2 for HgSt; a large 
number for HgX) of J, values. Pounp and La MEr’s® isotherms for the 
solidification of oxide-coated tin droplets can be described by a similar 
multiplicity in J,. Despite this multiplicity in /, the estimated K, 
values for HgSt-coated mercury and oxide-coated tin droplets are 
in excellent agreement with the theoretical values as indicated in 
the table. As a corollary 7’',,’ is displaced from 7',, by a very small 
amount in comparison with the supercooling. Because of the great 
multiplicity in J, it was not possible to estimate A, for HgX-coated 
droplets. 

In summary, the data on the solidification of supercooled mercury 
and tin droplets are compatible, to well within the experimental uncer- 
tainty, with the theory of homogeneous and heterogeneous nucleation 
in condensed systems. 

A valuable feature of the theory of homogeneous nucleation in con- 
densed systems is that K, is calculable from the characteristic constants 
for the system.”* It is thus possible on the basis of the theory to 
calculate the unknown parameter o from an / value for a single tempera- 
ture. We have confidence in the validity of this procedure because of 
the fair agreement between the theoretical and experimental XK, for 
mercury. TURNBULL and Cecn® have used a microscopic technique to 
study the solidification behaviour of small droplets (10-50 micron 
diameter) of a variety of metals. The maximum supercooling A7’,, 
before solidification that they obtained for the various metals is given in 
Table 4. Also listed in the table are A7’,, values for other substances 
obtained by other investigators using a variety of techniques.* It is 
supposed that the magnitude of the supercooling A7” corresponding to a 
perceptible rate of homogeneous nucleation is equal to or greater than 
AT,,. Assuming A7” = AT’, reasonable values of J were assumed and 
o for the various substances was calculated from equation (24). These 
calculations have been brought up to date and the resulting o values 
are listed in Table 4. 

* Recently there has been some dispute on the correct AT7',, for water. ScHAEFER** 
and CwILonc* agree that rapid formation of snow crystals begins when a supercooled 
fog reaches a temperature of — 39°. However, CwrLonc,** SANDER and DAMKOHLER,* 
and Pounp and Maponna® all report that a fog formed at — 50 to — 62°C by rapid 
expansion of water vapour may consist of supercooled water droplets. Their conclusion 
that the droplets were liquid was based on their failure to observe reflections from 
crystal facets that are expected from snow crystals. ScHAEFER*‘ has shown, however, 


that a dispersion of small ice spheres can exist without any perceptible reflections. 
We have tentatively accepted ScHAEFER’s value of A7,,. 
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TABLE 4 


Summary of Liquid-crystal Interfacial Energies Calculated 
from Supercooling Data 








Substance Reference AT’. o (ergs cm? a,/T,, cal/deg 
Mercury 28 77 28-1* 1-52 
Gallium 51 76 6-0 1-91 
Tin 4 118 4-0 1-59 
Bismuth 28 90 34-4 1-52 
Lead 28 80 33:3 0-80 


Antimony 
Aluminium 
Germanium 


Silver 











Gold 
Copper 
Manganese 


Nickel 





Cobalt 50 330 234-0 1-02 








Iron 









Palladium 


Platinum 


Water 








Benzene 









* Calculated on basis outlined in reference 43. 






‘ 






Recently Fatxennacen and Hormann® have, using very large cooling rates, 






supercooled aluminium about 195°. This value of AT’... gives a o,/AH, ~ 0-48 and in 






better agreement with the other values of c,/ AH, for face-centred cubic metals. 








The gram-atomic interfacial energy, ¢,, between liquid and crystal 
phases is defined as the free energy of a crystal-liquid interface con- 






taining one Avogadro number N of atoms. Assuming that the inter- 






facial energy is localized in a layer one atom thick it follows®: 
. ris pes 
d, N J 0. 
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where V is the gram-atomic volume of the crystalline phase. Turn- 
BULL® showed that o,, calculated from nucleation rates, is roughly 
proportional to the gram-atomic heat of fusion AH,.c,/AH, (see 
Table 4) ranges from 0-32 to 0-61. o, for the various substances is 
plotted against AH, in Fig. 9. Points for most of the metals fall fairly 
well on a common line of slope 0-46. 

Large liquid masses of metals that crystallize to close-packed struc- 
tures such as gold, copper or lead rarely supercool more than 5° before 


3000 -- 





1 l l l 1 
1000 2000 3000 4000 5000 6000 
Fig. 9. Gram-atomic interfacial energy, c,, as a function of the heat 
of fusion, AH,. Revision of TURNBULL® 





beginning to crystallize. For example, DantLov and NeuMARE, after 
subjecting lead to what they believed to be a satisfactory purification 
procedure, found that it started to crystallize at 3° supercooling. On 
the other hand, large liquid masses of metals that crystallize to more 
complex non-close-packed structures, such as tin, bismuth or gallium, 
often can be easily supercooled 25° or more before starting to crystallize. 
These results led to the concept advanced by DanrLov and NeumarK® 
and others that the supercooling corresponding to a perceptible nuclea- 
tion rate, A7’,,, is strongly dependent on the structure of the forming 
crystal and for liquids crystallizing to close-packed structures AT’,, < 5°. 
DantLov and Neumark, for example, believed that A7',, ~ 3° for lead. 
This interpretation was considered plausible on the basis of the widely 
held view that the arrangement of atoms in most liquid metals approxi- 
mates a close-packed structure. Thus the transition liquid (semi-close 
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packed) — crystal (close packed) could presumably take place with a 
minimum of atomic displacements. These views of the liquid — crystal 
close packed) transformation apparently were developed in ignorance 
of the results of [NceRsoLt, and MENDENHALL” and other earlier 
investigations of the supercooling of gold, etc. Certainly these results 
coupled with results summarized in Table 4 establish that A7',/T7’,, 

ire not widely different for metals whether or not they crystallize 

close-packed structure. Also BaRDENHEUR and BLEecKMAaNN® 
have been able, under certain conditions, to supercool 150 ¢ mass 
of quid iron 255° before the beginning of crystallization 

Although the nature of the forming structure has not a marked effect 
upon AJ’. 7. in solidification there is a rough correlation between the 
magnitude of the ratio o T’ , (see Table 4) and the structure com 
plexity.“ This ratio ranges from a value of 1-9 for gallium to about 
|-0 for metals that crystallize to close packed structures 

A substantial proportion of accidental catalyst particles are likely 
to be cubic or hexagonal in structure and may, therefore, be capable to a 
greater or lesser degree of catalyzing the formation of cubic or hexagonal 
crystals.“ On the other hand, very few substances have a structure 
very similar to that of gallium and, therefore, capable of effectively 
catalyzing the formation of gallium crystals. In a random selection of 
catalyst particles that might be present accidentally in a liquid metal 
it is probable that a large proportion would have cubic or hexagonal 
structures but few, if any, would have a structure similar to that of 
gallium. Therefore, a liquid that crystallizes to a complex structure 
will probably supercool much more relative to its melting point than a 
liquid that crystallizes to a cubic or hexagonal structure. 

Although Wesster® and others found a pronounced thermal history 
effect in the solidification of large masses of bismuth, no measurable 
thermal history dependence of supercooling in the solidification of small 
bismuth droplets has been found.” These results show clearly that the 
thermal history effect depends for its existence upon the presence of 


crystal nucleation catalyst particles in the liquid. 


Crystal — Liquid Transitions—The frequency of homogeneous nuclea- 


tion of liquid droplets in the body of a crystal should be given by an 


equation identical in form with equation (24). This theory predicts that 


the homogeneous formation of liquid nuclei should not become percep- 
tible until the superheating becomes almost as great as the supercooling 
required for a perceptible rate of homogeneous crystal nucleation in the 
liquid 

Experience indicates that crystals almost invariably begin to melt 
ut temperatures not perceptibly different from the equilibrium tem- 


perature. Also it is known that most crystals are completely wet by 
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liquid phase of the same composition at the melting temperature, 
hence : 
Coy > Orr + Cre >e ee Ce 


Therefore, there can be no net increase in surface energy when a liquid 
forms on a free solid surface of the same composition and AG,* for 
forming liquid nuclei on this surface is zero. For this reason crystals 
will not superheat when they have any part of their surface exposed to a 
gas phase. 

Thus it is only possible to superheat solids by preventing in some 
way the exposure to vapour of the solid surface at the equilibrium 
temperature. Kuarkrn and Benert®’ have succeeded in superheating 
the interior of a tin single crystal 1 to 2°. In their experiment a large 
electric current was passed through the crystal. The energy was carried 
away from the free surface of the crystal so that a temperature gradient 
of 1-2° existed between the crystal axis and the surface. Although poly- 
crystals and defective single crystals invariably started to melt inter- 
nally sound single crystals always started to melt from the free surface, 
indicating that the material at the crystal axis was superheated 1-—2°. 
This amount of superheating is only a very small fraction of what is 
possible theoretically (~ 100° for tin). 

An experimental approach to the problem that has not yet been 
successfully worked out is to completely coat small crystals with a sub- 
stance that strongly catalyzes their formation in supercooled liquids. 
For example, it should be possible to substantially superheat tiny ice 
crystals covered with a coherent coating of silver iodide. 


STRUCTURE AND COMPOSITION 


Among the common types of transformation involving a change in 
structure and composition are: (1) condensation of liquid from a 
gaseous solution; (2) the reverse process of formation of gas bubbles in 
liquid solution; (3) precipitation of crystalline phases from liquid 
solutions; (4) precipitation of crystalline phases from solid solutions. 
We shall ignore the first two types and treat the last two, which are of 
most interest in metal systems. 

In the nucleation of crystals from solid or liquid solutions the 
important variables are structure, the strains due to the disregistry of 
the structures, composition and temperature. In our first approxima- 
tion we will neglect effects due to strain. 


Thermodynamics of Precipitation 


The Gibbs free energy, G, of a solution is a function of composition x 
(mole fraction of one component), temperature 7’ and pressure P. 
Fig. 10 (a) is a schematic representation of G as a function of z (mole 
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Here, G is a continuous function of 


fraction of B) at constant 7' and P 
x and applies to a series of solid solutions of one crystal symmetry (e.g. 
face-centred cubic) or liquid solutions. Fig. 11 shows two G = f(x) 
relations, each characteristic of phases « and / (either two solid solutions 
or a liquid and a solid solution). The most convenient criterion of 


equilibrium in these systems is that G be a minimum at constant 7' and 


P. Applying this criterion to the system represented in Fig. 10 (a) it 





Fig. 10. (a) Free energy of a solid solution as a function of composition 
b) Temperature composition equilibrium diagram for two components 


partially miscible in the solid state 


follows that the composition of the solution ranges continuously for 
~— . _ . - . ~ 7 

x < x,andforz, > x. Forallz, < x < 2,, the free energy of the system 

is minimized by the coexistence of two phases of composition x, and x, 

[a and } are points of tangency of the line ab to G = f(z)]. Similarly it 

is found that the free energy of the system represented in Fig. 11 is a 


minimum for z, < 2 < x, when two phases of composition z, and z, 


coexist. 
In the conventional representation of phase equilibrium the phase 


coexistence relations are projected on a 7'-z plane for constant P as 
shown in Fig. 10 (6). Also plotted as dotted lines (hd and he) in Fig. 
10 (6) are the spinodal relations. The points on these curves fulfil the 
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condition [0°G/dz*], p= 0. In areas hda and heb [0°G/dz*|7 p > 0 
and in area hde [O°G/dz*)_7 p < 0. 

Now consider fluctuations in composition in metastable solid solutions 
for x, < x < 2,. A cluster of atoms formed by a fluctuation may, for a 
given mean composition of the supersaturated solution xz, de charac- 
terized by the number of atoms, i, that it contains and Az, where 
Ax = x’ — x, and 2’ = the composition of the cluster. It is assumed, 
following Brecker® and Bore tivs,** that the composition of the 
matrix immediately surrounding the cluster is x. In our discussion we 
shall consider only fluctuations favourable to the nucleation of 2. 








A x 
a Xp ° ax 


Fig. 11. Free energy—composition Fig. 12. Free energy of a 
dependence at constant temperature cluster as a function of the 
of two solid solution structures, fluctuation in composition 
a and f at constant size. After 
Bore ivs** 


Let AG be the free energy increase per gram atom, neglecting inter- 
facial energy, for the formation of a cluster. It follows from thermo- 
dynamic principles that 


AG = G@’ — (G + AxrdG/daz) ens OR 


where G’ is the free energy per gram atom of solution at mole fraction 2’, 
G is the free energy per gram atom of solution at mole fraction z, 
aG/dx is the slope of G = f(x) relation at x. AG is the difference in G 
co-ordinates at a given 2’ of G = f(x) and the tangent to G = f(x) at 
point xz. For the matrix composition corresponding to point c, AG is 
proportional to the length of the line df at x’ = x,’ and to minus the 
length of the line bg at 2’ = 2. 

Fig. 12 shows AG = f(Az) for a given matrix composition z. The 
function goes through a maximum at the spinodal composition z,'. For 
x’ >,’ at constant i the free energy of a cluster is decreased by 
fluctuations increasing Az but for x’ < z,’ the reverse is true. 

Let AG, be the free energy per gram atom for forming clusters with 
x’ = 2», neglecting surface energy again. AG, as a function of matrix 
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composition z is represented schematically in Fig. 13. AG, = f(x) goes 
through a minimum at the spinodal composition x = z,' and a maximum 
at spinodal composition z = z,’. 


Nucleation of a New Phase 


AG,, the increase in free energy due to the formation of a cluster con- 
taining i atoms, is a function of 7', x and Az. Fora given value of 7’ and 
xz we may write 


AG, = aai™*® + AG(i/N) S elae oe 


where o, the interfacial energy per area between the cluster and matrix, 
is a function of z and Az, a is a shape factor, AG is given by equation 





| 
| 
| 


Fig. 13. Free energy per gram atom of a cluster having the equilibrium 
composition of the precipitate from a solid solution 


(47), and N is Avogadro’s number. The number, n,, of clusters per 
volume in equilibrium with the matrix is 


n, = ny, exp [— AG,/kT), > wee? Tee 


where n, is a slowly varying function of Az. 

For a given value of z, AG, can be plotted as a function of i and Az. 
In order to become a nucleus for the growth of phase of composition z,, 
a cluster must have attained a critical size and composition (critical 
cluster) such that further increase of either i or Az will decrease AG,,. 
From the surface AG, = f(i, Ax) the minimum, AG, = AG,*, for the 
formation of a critical cluster can be found. 

The location of the saddle point can be found formally from the 
following equation : 


AG, = ai**f(Ax) + (i/N)F(Az) é oes ee 
obtained by substituting o = f(Ar) and AG = F(Az) in equation (48), 
where f(Az) and F(Az) are functions of temperature and matrix 
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composition. By setting [0(AG,)/di],, = 0, we find the height of the 
surface ridge AG,° to be: 


AG,° = A[f(Ax)]}*/[F (Ax)? ge 


where A = 4a°N?/27. The value of Az = Az* at the saddle point is 
found by solving the equation 


3F (Ax) f'(Ax) — 2f(Ax)F’(Ax) = 0, ae ez 


obtained by equating d(AG,°)/d(Azx) to zero. Substitution of Az* into 
(51) gives AG,*. The practical solution for the saddle point position 
from equations (51) and (52) is complicated by our lack of knowledge of 
the interfacial energy and its dependence upon composition. A simpler 
solution is obtained if, following Brecker,” it is assumed that the 
position of the saddle point is always at z,. Then AG = AG, and cisa 
function of z only, o = f(z) so that 


AG,* = Ao?/(AG,)?. tg ee 


By a nearest neighbour approximation BECKER estimates oa 
= (E/v"*)(x — x,)*, where v = volume per atom of precipitate, Z 
= Eyp— 1/2(E 44 + Egg) and £4 4, etc, are the pairwise interactions 
of the atoms A and B making up the solution. Hossterrer® allowed 
the composition of the nucleus to vary and derived a formal expression 
for the rate of nucleation. 

From the theory of nucleation in condensed systems it follows that 
the steady state frequency of homogeneous nucleation of precipitate in a 
supersaturated solution is (see equation (24)): 


I = (x — 2,)n*(ao/9rkT)*n(kT /h) exp [— (AG,* + AG ,)/kT}, 


(54) 


where AG,* is given by equation (51) or (53), AG, is the free energy for 
diffusion of component B in the solid solution and the other terms have 
the same significance as in equation (24). 

We have assumed that ¢ is isotropic or that the critical clusters are 
bounded by crystallographically similar planes. It is easy to refine 
the treatment to take into account the nucleation of clusters exhibiting 
simple departures from spherical or cubical forms. For example, sup- 
pose that, due to the anisotropy of the interfacial energy, the critical 
clusters are disc-like. Let the radius of the disc be r and its thickness 
t, then: 

AG, = 2arto, + 2mr*o, + (rr*t/V)AG, ... . (55) 


where V is the volume per gram atom of precipitate and o, and o, 
are the interfacial energies per area of the disc edge and face 
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respectively. It follows that the radius r* and thickness ¢* of the 
critical cluster are : 
r* = — 2¢6,V/AG, . «. « « (56a) 
t* = — 40,V/AG, s+ =e 


whence it follows: 
AG ,* = 1670,*0, V?/(AG,)* 5 a 


In discussing precipitation from solid solution, KoNOBEYEVSKI® and 
Bore.ivs® have ignored the interfacial energy between the cluster and 
the matrix and set AG, = (i/N)AG. Even though not implicitly stated 
these theories, if valid, must apply whether « and # are liquid or solid 
phases. Recently this theory has been further developed and advanced 
by Boretivs. According to Bore.ivs’ theory, AG,* must lie along the 
ridge of the AG, = f(i, Ax) surface corresponding to the spinodal com- 
position z,’. However, the height of this ridge is a minimum when 
i= 1. A literal application of the KonoBpEyevski-Bore.ivs hypo- 
thesis, therefore, leads to the prediction that AG,* is virtually zero for 
any 7 and zx. In order to obtain a partial description of the experi- 
mental data Bore ivs finds it necessary to postulate that the critical 
cluster crosses the ridge at AG,* = (i*/N)(AG),., where i* is of the 
order of 100 to 200 atoms. It is presumed that a fluctuation must 
include 100 to 200 atoms in order to be significant but there is no theory, 
based on the KoNOBEYEVSKI-BORELIvS postulate, that permits any 
prediction of the magnitude of :*. 

Boreivs has analyzed certain data on rates of precipitation and 
shown that there appear to be certain singularities associated with pre- 
cipitation from supersaturated solutions initially having the spinodal 
composition. When log r, where + is the time required for half-com- 
pletion of precipitation, is plotted against 1/7’ the curves for various 
values of z, the initial matrix composition, are parallel at relatively low 
temperatures. That is, the slopes of the curves are a function of 7 
only. It is found that the temperature 7” at which the slope becomes 
dependent upon z as well as 7’ corresponds to the temperature at which 
x = z,', the spinodal composition. BoreELivs writes for r, 


r = f,(AG,*) f.(x) f,(7) 5> Ghee ce 


where /,(z) and /,(7') are factors that describe the rate of growth of the 
precipitate. Since AG,* = f(x, 7’) and at T = 7’, AG,* = 0, it follows 
from (58) that dr/d7' varies with z for T > 7” and is independent of z 
for 7 <7". This agreement with experience constitutes the main 
evidence for the KoNOBEYEVSKI-BorRELIvs theory. There are, however, 
certain facts that weaken the significance of the agreement. Bore ivs’ 
development is intended to describe the rate of nucleation at some 
specified matrix composition. However, in testing his theory BorELIvs 
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assumed that 1/7 is proportional to the nucleation frequency at zero 
time. Since 7 must be a resultant of the rate of nucleation and growth 
of precipitate, neither of which were measured independently in the 
experiments cited, the validity of this assumption is doubtful. It 
should also be realized that in considering the precipitation from solid 
solutions strain energy induced by the transformation must be taken 
into account. 

There are some other important characteristics of spinodal solutions 
that might cause singularities in their behaviour. Most noteworthy is 
that the driving free energy, AG,, for the nucleation and growth of the 
precipitate of equilibrium composition, is a maximum (see Fig. 13) at 
the spinodal composition. Therefore, it may be possible to explain a 
singularity in the kinetic behaviour at the spinodal temperature (for a 
given x) in terms of rates of precipitate growth or nucleation of clusters 
of composition z, without recourse to the KONOBEYEVSKI-BOoRELIUS 
theory. 


Comparison of Theory and Experience 


In order to compare the theory with observation several conditions 
must be fulfilled. It must be assured that the nucleation of the new 
phases occurs homogeneously, and the strain energy induced by the 


transformation is negligible. The frequency of nucleation must be 
measured directly. Even more importantly for a quantitative compari- 
son, the dependence of free energy, AG, and the interfacial free energy, 
o, on the composition must be known. The former dependence can be 
measured by determining the energetics of the phases when both stable 
and metastable, or by studying the energetics of the stable phases and 
obtaining the difference of free energy by extrapolation. In principle, 
the dependence of the interfacial energy upon composition can be 
measured, but unfortunately, experimental difficulties have apparently 
mitigated against its successful determination in any case. 

While no direct measurements of the frequency of nucleation have 
been made for the formation of a metallic solid solution from either a 
liquid or a solid solution, the temperature at which the nucleation rate 
becomes sensible has been determined for solidification in several binary 
systems.*: ® As indicated previously, since the rate of nucleation from 
fluid liquids is so temperature dependent, a nearly critical temperature 
of nucleation of the solid will be observed. In measuring the critical 
temperature for two series of liquid solutions the small particle tech- 
nique already described was applied. Results were obtained in the 
case of the Cu-Ni system® in which there is complete miscibility in both 
the liquid and the solid states, and for the Pb-Sn system®™ in which 
there is miscibility in the liquid but immiscibility in the solid state. 
The latter is a simple eutectic system. The variation of the temperature 
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ut which the nucleation frequency became sensible with composition 
ure given for both these systems in Figs. 14 and 15. In the case of the 


Cu-Ni system the critical temperature of nucleation nearly parallels 
} ; 


wot 


h uquidus temperatures and is continuous from the copper to the 
nickel side of the phase diagram. The maximum supercooling is of the 
order otf 20 per cent the absolute melting tem perature and appears to 
cur by essentially the same process in the alloys as in the pure 
elements copper and nickel! In the case of the Pb-Sn system the 
- = 
se 

™ 
™~ 


™ 


; 
i 
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wT PPCENwTace pera 
14. Solidification temperatures of Cu-Ni alloys as a function of 


omposition. After Cecn and TURNBULL" 


critical temperature of nucleation increases with the first small addition 

tin to the lead as well as with the first small addition of lead to tin. 
Thus, for all compositions the magnitude of the supercooling is less than 
for the pure elements. These results indicate that the interfacial energy 
between the pure solid and liquid is decreased slightly by the first 
uidition of both tin and lead, otherwise the results are very similar to 
those obtained with pure metals. However, since as already indicated, 
the dependence of the interfacial free energy on composition is not known 
tor either of these systems, a quantitative comparison of these experi 
ments with theory is im possi ble Neither can the interfacial free energy 
be obtained from the nucleation results as was the case for the solidifica- 
tion of pure metals. However, the results are compatible with the 
general nucleation theory 

In considering precipitation from solid solution the strain energy 
induced by the transformation generally must be taken into account, as 
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indicated in the next section. Although there are no direct measure- 
ments of the frequency of nucleation during precipitation, many 
measurements have been made of the overall rate of the process, R. 
This overall rate is a composite of the rates of nucleation and growth of 
the precipitate. From 2, it is possible, with the aid of various assump- 
tions, to deduce something about the rate of nucleation. We have 
already noted that BorELivs assumed that the value of R when 
precipitation is half-completed is proportional to / when precipitation 
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Fig. 15. Initial (X) and final (A) solidification temperatures of lead rich 

lead-tin alloy droplets as a function of their composition. Also solidi- 

fication temperatures (()) of tin rich lead-tin alloys as a function of 
composition. After HOLLOMON and TURNBULL® 


has just begun. With the same assumption Harpy®™ showed that the 
data of Bore .ivs et al® on precipitation of tin from lead rich solutions 
can be described qualitatively by the Becker theory of nucleation. 
BecKER made similar assumptions when comparing the results of 
JOHANSSON and HaGsTen® on precipitation from an Au-Pt alloy with 


the predictions of his theory. 

A procedure far more satisfactory than using a single datum on the R 
vs ¢ curve is to determine the functional relation between # and 
({R = f(t)] and find what assumptions about J are compatible with it 
An example of this procedure is the analysis by WERT and ZENER® of 
Wert’s® data on R = f(t) at various temperatures for the precipitation 
of iron carbide from «-iron. Making reasonable assumptions about 
the growth of the precipitate and taking impingement into account 
they were able to describe the R = f(t) relations on the assumption 
that all nuclei were effectively in existence at zero time indicating 
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that nucleation was heterogeneous. There are in most solids non- 
equilibrated structural defects. These defects as well as accidental 
inclusions might serve as preferred nucleation sites. 


STRUCTURE AND STRAIN 
Incoherent Nucleation 


A change in the specific volume of the system accompanies most phase 
transformations. When transformation occurs within extensive gas or 
liquid phases of high fluidity the volume changes are accommodated by 
flow of the extensive phase. However, crystals and glasses can be 
subjected to a considerable stress without flowing at appreciable rates. 
It follows, therefore, that the volume change accompanying the forma- 
tion of an element of a new phase in a crystalline or glassy medium may 
have to be accommodated by strains set up in the medium and possibly 
the new phase. Therefore, it is necessary to consider the strain energy 
as part of the free energy of the formation of an embryo. This trans- 
formation strain energy will be proportional to the number of atoms 








contained in the embryo and equation (2) is modified to: 


AG aoi"*® + bi(AG, + EZ) > ss > ie 












where £ is the increase in strain energy of the system accompanying the 





formation of a unit volume of anembryo. Evidently nucleation will not 
occur unless AG, + £ < 0. 

A crystal usually begins to flow when a macrostress corresponding to 
an elastic strain energy E ~ 0-01 cal/cc is applied. However, according 
to the theory of plastic deformation perfect crystals should flow only if 
ES 100 cal/ce. Real crystals flow at very low applied stresses only 
because they contain non-equilibrated imperfections. These imperfec- 
tions (FRanK-REapD® sources) generate dislocations that lead to plastic 
flow upon the application of small stresses. It is believed that the con- 
centration of these imperfections in a well-annealed crystal is so small 

~ 10" /cc) that there is a negligible probability that a nucleus, which 
has a volume of the order of 10-* cc, formed at random will contain one. 
Therefore, in the vast majority of sites available for its formation trans- 
formation stress energies of the order of 100 cal/ec can be supported. 
However, the ability of elements of real crystals to support such strains 
varies from element to element and it is expected that when the trans- 
formation stresses are large nuclei will form preferentially in elements of 
the crystal that do not, because of imperfections in their vicinity, sup- 
port large stresses. In general, Frank—Read sources will not operate 
unless the stress is applied throughout an element of volume much larger 
than 10-*° cc. These sources will, therefore, play an unimportant role in 
relieving transformation stresses in nuclei formed in their vicinity. 
However, stresses in very small volume elements in the vicinity of grain 
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boundaries and free surfaces probably can be partially relieved by flow.* 
For these reasons we expect grain boundaries and free surfaces to be 
preferred sites for nucleation in crystalline media.t 

Since the dislocations in a real crystal are not in thermal equilibrium 
AG, = AG,’ for volume elements transforming in the vicinity of 
dislocations is not necessarily identical with AG, for the average volume 
element. We expect that there will be some dislocation arrays for which 
AG, > AG,’. Further the magnitude of the elastic coefficients in the 
neighbourhood of dislocations may be markedly different from the 
mean elastic coefficients of the crystal. For these reasons some arrays of 
dislocations in the body of the crystal will be preferential sites for 
nucleation. 

When a nucleus forms on a grain boundary some of the surface work 
required for its formation is regained by destruction of the grain 
boundary which has a free energy per area equal to c,. The free energy 
for forming an embryo on a grain boundary is: 


AG = (ac — ga,)i** + (AG, + E)bi, ... . (60) 


where it is assumed that o is not dependent on crystal orientation and g 
is determined by the shape of the boundary region destroyed. For 
example, if the embryo is spherical in form, equation (60) may be 
rewritten : 

AG = (40 — a,)nr*? + (AG, + E)trr®. . . . . (61) 


We have noted that in liquid and gas media nucleation may be 
heterogeneous because nuclei form preferentially on the surface of 
accidental inclusions. While accidental inclusions are probably impor- 
tant catalysts for nucleation in crystalline media the factors sum- 
marized above probably also contribute to an important degree to the 
heterogeneity of nucleation in such media. On the basis of these con- 
siderations it is expected that free surfaces, grain boundaries, inclusions, 
and elements in the body of the crystals having a high stress energy will 
serve as preferred sites for the nucleation of phases in crystals. For 
small | AG,) it is probable that nucleation in crystals is entirely hetero- 
geneous. 

However, we shall develop the theory for the nucleation of a crystal- 
line phase in a perfect crystal. The main features of this development can 
be adapted easily to a formal description of heterogeneous nucleation. 

NaBaRRo® has developed the theory for the relation between strain 
energy due to precipitate particles and their shape and coherency. His 
treatment will be adapted to the development of the formal theory of 
nucleation for phase transformations involving no composition change. 

* We are indebted to E. W. Hart for advice on these points. 


+ There are additional reasons, to be discussed, why grain boundary sites should be 
preferred in nucleation. 
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Consider the phase transformation « — 8, where « and f are crystal- 
line phases. Let a, be the lattice spacing of some low index plane, P,, 
in the # lattice and a, + Aa the lattice spacing in similar low index 
plane of the « lattice, then we define the disregistry, 6: 


0) Aa ‘a, sees Ce 
In general the # nucleus may be strained an amount: 
(2 — Ay)/A sco ee 


where x is the lattice parameter of the P, plane of the nucleus. When 
e = 6 the nucleus is said to form coherently. 

Suppose that the 8 embryo is completely incoherent with « (i.e. 
e = 0). The accompanying volume change is accommodated by strain 
in the embryo and in the « matrix surrounding it, the strain distribution 
depending on the elastic coefficients of the two structures. However, 
following NABARRO, the strain energy of the system is a minimum when 
incoherent 8 has the form of a thin disc and all the strain is in the 
surrounding «. 

Consider a hole in the « having the form of an ellipsoid with semi- 
axes a,a,and 6. NaBarro calculated the strain energy per volume, £, 
required to expand the hole for the limiting forms thin disc, sphere and 
needle and proposed the following general relation between strain energy 
and shape: 

E = K\AV|** f(b/a), —— 
where AV is the difference in specific volume of the non-stressed « and 
8 structures, K is proportional to the rigidity modulus of «, f(b/a) = 0 
for b/a = 0, and is a maximum for a sphere (6/a = 1). 

Although the strain energy of an incoherent embryo is zero when it 
has the form of an infinitely thin disc it will probably have the form of 
an oblate spheroid so that the « — # interfacial energy will not be too 
large. The free energy for forming the embryo is: 

AG = #ra*(1 — e*)'*{ AG, + K|AV|**f(1 — e?)"*} + 
1 — e? l+e 
mata | 2 + | -} In { . . 2. « (68) 
é i <n ¢ 
where « (1 — 6?/a*)'*. The minimum free energy for forming a 3 
nucleus is: 


AG* = f(a*, e*), oo ew 


where a* and e* simultaneously satisfy the equations : 
[o(AG)/da}], = 0 ; “ah (os te 


(68) 
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Coherent and Semi-coherent Nucleation 


Although the increase in strain energy due to the formation of incoherent 
6 in the form of a thin disc is very small the surface energy may be 
quite large. Now suppose / forms so that a plane P, and crystallo- 
graphic direction, d, is parallel to a plane and direction of similar atomic 
packing in the «. It has been postulated that the interfacial energy 
between two phases is a minimum when they are in perfect registry 
e) across their boundary. 

In general, 6 > « and the actual disregistry is equal to 6—«. For 
6 — « not too large (e.g. 6d — « < 0-20) the «-f interface across plane P, 
can be pictured as consisting of small regions of relatively good fit 
separated by an array of intersecting line dislocations.t The density 
of dislocations in this array is 6— e. Frank and Van DER MeRwE” 
demonstrated that in a misfitting monolayer the energy due to dis- 
locations is directly proportional to their density, p. It has been 
assumed” that p oc 6 — « for the formation of a / nucleus on «, so that 
o between « and f may be written*® 


o=y+7(d—e), ine oe ee 


where 7 is a constant and y is a term due to bond type and chemistry. 
It is apparent that o is a minimum when 6 = «. 

In the case of semi-coherent precipitation the problem of minimizing 
the elastic energy of the system is somewhat more complicated than in 
the case of a completely incoherent particle. If the moduli of the 
precipitating particle are in all directions less than the moduli of the 
matrix material, then the precipitate will be plate-like so that the strain 
will reside primarily in the precipitate and the elastic energy minimized. 
In this case the elastic energy can be evaluated in the way indicated by 
NaBaRRO,® who has shown that £ for a disc shaped coherent precipitate 
is of the order of (1/5)E for a spherical precipitate. Applying these 
arguments to the nucleation of coherent f in « we expect that # embryos 
will probably again have the form of oblate spheroids and the free 
energy required to form them is: 


AG = tna*(1 — e?)*{AG, + K'e®f(1 — e)"*} + 
co : ' 


ra*{y + (6 — e)] |2 1 Eas in (i= *)], ba a: on 


é é 


where K’ is proportional to the elastic coefficient for the coherent 
formation of 8. The minimum free energy for forming the nucleus is: 


AG* = f(a*, e*, e*) , oo « & Coe 


+ Van DER Merwe” and Brooks” have developed the description of interphase 
boundaries in terms of a dislocation model. Brooxs has discussed in general terms the 
application of the theory to nucleation in crystalline media. 
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where a*, e* and e* simultaneously satisfy the equations 
[o(AG)/da], , = 0, Se Ww eee 


[a(AG)/de], . = 0, ine Be 


4,6 


[O(AG)/de], , = 0. oss See 


In the case of a precipitate having all its elastic moduli larger than 
that of the matrix material, the shape of the precipitate which will 
minimize the elastic energy is that of a sphere. If, on the other hand, 
the modulus in one direction of the precipitate is less than the moduli of 
the matrix, the shape having minimum elastic energy will be rod-like. 
However, for embryos and nuclei, the anisotropy of the interfacial 
energy between particle and matrix will probably be more important in 
determining the shape of the particle than will considerations of elastic 
energy. The exact solution for the free energy of an embryo and a 
nucleus having varying elastic moduli can only be established if the 
dependence of the elastic modulus of matrix and precipitate on orienta- 
tion as well as the anisotropy of the surface energy are known. 


Theory of Catalysis of Crystal Nucleation 
In a preceding section a formal theory of nucleation catalysis was 
developed in which the catalytic potency was characterized by the 


contact angle 6. While this theory proved to be satisfactory for des- 
cribing the rate of solidification of mercury and tin droplets coated with 
various surface films, it gives little information on the mechanism of 
nucleation catalysis. 

On the basis of the postulate made in the preceding section that the 
interphase interfacial energy is a minimum when the phases are coherent 
across the boundary, a theory of nucleation catalysis has been derived.” 
According to this theory the free energy of a crystallization nucleus 
formed on a catalyst surface is a minimum when the nucleus is at least 
partially strained in the direction of better atomic matching with the 
catalyst across the interface. 

We obtain the interfacial energy between the catalyst and the 
nucleus, ¢g¢, from equation (69). Now relating og, to the contact 
angle through equation (15) we obtain: 

l m p + (0 — £)/Or¢, poe: ae) ee 


where m cos § and 
5S= l \ors = y) Ore: . . ° . (76) 


The free energy of forming a nucleus having the shape of a spherical 
sector of radius r on a catalyst surface is 
AG, V(AGp + ce*) + 2nr*(1 — moze + wr*(1 — m*)(ag¢ — Gzsz), 
(77) 
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where c is the appropriate coefficient of elasticity and V is the volume 
of the sector. AG,* for a given strain is: 


AG,* = 470,,,5(2 + m)(1 — m)?/3(AGy, + ce?)®, . . . . (78) 
and the value of ¢ = e* that minimizes AG,* is: 

e* = (6 + Bozo/n) — [(6 + Bozc/n)® — |AGp|/c}**. . . . . (79) 
When the nucleation rate is perceptible, say 1 cm~* sec, AG,* ~ 60kT 
and |AG,| = |AG,’|. For cd? > |AG,’| and making the approximation 
2+ m = 3 we obtain | AG,’| = f(6) as follows: 

|AGp’| = (4270,-/60kT)"*(Borzo + 6). . . . . (80) 
If | AG;’| is larger than cé? the nucleus forms coherently and we have: 
|AG,’| = cé?. os 4* fae 


The approximate condition for coherent nucleation is found by equating 
(80) and (81) to be: 


6 = [nu + (mtu? + 48uc)*)/2e= 6, . . . . (82) 


where u = (470;,./60kT')'”. 

|AG,’| is proportional, equation (30), to the supercooling AZ” at 
which nucleation is perceptible. For 6 < 6,, AJ” varies as 6*. When 
6 > 6, and for a given value of 8, A7” varies as 6. 

The potency, p, of a nucleation catalyst is proportional (see equation 
(80)) to 1/(Boz- + 6). Although p and 6 are reciprocally related the 
order of p is not necessarily the same as the order of 1/5 because of £. 
If it is assumed that*® 

Ore = ors — Y; é$0eee (83)T 


8 = 0 and a particularly simple theory of nucleation catalysis results 
wherein the order of p is identical with the order of 1/64. 

When equation (83) holds equations (79), (80) and (82) reduce to, 
respectively : 


e* = 6— (6— |AG,|/c)"”, 
|AGy'| = (4270,,/60kT)'*6, 


and 
6, = (4207,-/60kT)'*(n/c). eo 


There are a few data sufficiently quantitative to test the predictions 
of the crystallographic theory of nucleation catalysis. When it is 
assumed that ice nuclei form coherently on silver iodide (6 = 0-0145) a 
value of AJ” ~ 3-1° is calculated™ that is in good agreement with the 
value A7’ ~ 2-5° measured by VonneEGuT. Experience indicates” that 
nuclei probably do not form coherently on catalyst surfaces for 


+ The plausibility of equation (83) has been discussed elsewhere *° 
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0-005 to 0-015. This result is in rough agreement with the predic 

if FrRanK and Van DER MERWe’s”™ theory 
It was pointed out®*® that most of the results on the kinetics of 
heterogeneous nucleation of mercury and tin crystals can be derived on 
the assumption that the nucleus is derived from a macroscopic structure 
Wing a melting point little, if any, different from the melting point of 
ible structure. These results indicate that the nuclei formed 
incoherentiv and with negligible strain Because of the large super 
ling and the marked dissimilarity in structure between the droplet 
surface films and the forming metal c1 ils it is not surprising that the 


nuclei formed with ~ U in these experiments 


have been tacitly assuming that transformation nuclei 

ulways characterized by the minimum possible free energy of forma 
tion. Actually the net rate of trans- 

formation, dx/dt, of x to 8 must be 

some function of the product of the 

rate of nucleation / and the rate of 

increase in volume of the nucleated 

region. The rate of increase in volume 

if the nucleated region is in general 

proportional to R*, where & is the 


linear growth rate. Therefore 
it lk 


Now the transformation from « to 

will occur by a mode of nucleation 
and growth that will make dxdt a maximum tt the value of / 
corresponding to the minimum AG* be /’. The structure of the nucleus 
corresponding to /’ is not necessarily identical with the structure that 
ads to a maximum growth rate, R.... We shall find that the marten- 
sitic of transformation is brought about by nuclei (S-nuclei) that 


require a much higher tree energy for their formation than would 


her possible configurations (.V-nuclei Nevertheless, trans 


is effecte S-type rather than .\-tvpe nuclei because the 


growth o regions is many orders of magnitude greater than 


merent formation of p in « the trans 

formation strain energy is a minimum when / has the form of a thin disc 

ind is strained in tension or compression to be coherent (normal or 
N-type coherency) with « across a particular crystallographic plane 

\ different type of coherency results when the z matrix immediately 


= 


surrounding the 2 is strained by shear to be coherent with 9 (shear or 
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S-type coherency). Fig. 16 shows schematically a § region surrounded 
by a region of « sheared to maintain coherency across the interface. 
E for a 8-S embryo is larger than EZ for a 8-N embryo. However, the 
atomic displacements required for the growth of S-nuclei are in most 
transformations much less than those required for the growth of 
N-nuclei. Therefore, S grains once formed grow at rates many orders 
of magnitude faster than N grains, so that even though the N-nuclei 
outnumber S-nuclei most of the transformation may be effected by 
S-nuclei. 

FisHER, HOLLOMON and TURNBULL” have developed a theory for the 
energetics of formation of S-nuclei. They assumed that the embryos 
form in the shape of lenticular discs of radius r and thickness ¢. The 
volume of this disc is wr*t/2 and its area is 27r*. It is assumed that the 
strain energy is localized in a sphere of radius r surrounding the embryo. 
The strain at any point in this sphere is t/; when t <r. Hence, 


E = }e(t/r)?, ee 


where c is proportional to the shear modulus of «. The free energy of 
formation of an S-embryo is then: 


2nr*a + artAG,/2 + crrf* ¢ 6) 0a ee 
and 
AG* = fi(r*, t*) 


where r* and ¢* simultaneously satisfy the equations 
[a(AG)/or], = 0 
[a(AG)/ot}, = 0 
Carrying out the indicated operations we find: 
AG* = 8192nc%o3/27(AG,)*, i ook 
t* = — 160/3AG,, . ee (94) 


tip = t*/r* — AG, /4c. : . « (95) 


S-embrvos grow and shrink rapidly so that down to relatively low tem- 


peratures « will contain an equilibrium distribution of f-S embryos 
characteristic of the given temperature. Therefore, this theory predicts 
that martensite transformation nuclei only form isothermally. 

We have pointed out that nucleation in crystals may be heterogeneous 
to a marked degree because of non-equilibrated imperfections. That is, 
AG* may vary from element to element in the « to give a spectrum of 
preferred nucleation sites characterized by different values of AG* and 
volume, v, of the element. Provided v is larger than the volume, v*, of 
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the critical nucleus for growth of 8 into x of mean free energy the 


nucleation lrequency may be written 
/ A/V)‘ V,exp AG.* kT) 96) 


where | is the volume of the specimen and |’, is the total untransformed 
volume of elements characterized by AG,*. Let /, be the nucleation fre- 
quency per volume corresponding to a particular AG,* and for simplicity 
issume that all the preferred sites have the same volume v and yield 


ony one nucleus. It then follows that there will be a dependence ol 


/, on time at constant temperature given by 
# (1/V) 5 1,V,° exp I vt), > 
where V Vi att 0. This generalization predicts that isothermal 






martensite nuclei will form at preferred sites and that the nucleation 






frequency will decrease with increasing time. 






Experience indicates that many martensite transformations are 






athermal—that is, the amount of transformation appears to be indepen- 






dent of time at temperature and is a function of temperature only. It is 






us though transformation nuclei were in existence at all times and that a 






certain number, n,, of these become active for growth when the tem- 





perature is lowered by a unit amount—wn, increasing with decreasing 






temperature 
This athermal characteristic of some martensite reactions is explained 






on the basis of either frozen-in composition fluctuations or by frozen-in 






variations in free energy from point to point in the « crystal due to local 






strains. Formally, the theory is very much the same from either point 
of view. We shall first develop the theory for the formation of athermal 






nuclei from composition fluctuations. 






When « contains a major component A and a minor one B, where B 






has a strong stabilizing effect on « relative to 8, we may expect that / 






embryos will form preferentially in regions that, because of composition 






fluctuations, are depleted in B. Let Ag, be the increase in free energy 






associated with removing all i atoms of B from a region of « of average 






composition. According to the development of Fisher, HOLLOMON and 






TuRNBULL™ the number of these regions per volume, n,, of pure A is: 






n nh eXp Aq / kT . (98) 





where n is the number of lattice sites per volume. The number of these 





regions n,* exceeding a critical size corresponding to i S i* is 















. 


n.* | ndi (n/i)( Ag,/kT’) exp Ag */ET) .... (9) 





* 
i* 





: 


where Ag, Aq,* ati i* 
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Now suppose that at a given temperature the §-S nucleus would con- 
tain i* atoms of B if it had the same composition as the « matrix. AG; 
may be large enough so that the equilibrium concentration of S-nuclei, 
n* =n exp [— AG*/kT), is negligible. However, the number of « 
regions, n,*, free of B and having dimensions equal to or larger than t* 
and r* can be appreciable. Since the volume free energy loss for forma- 
tion of § in one of these B- poor regions (B-) is much greater than for its 
formation in a region of uniform composition, the probability that will 
nucleate in a B- region is many orders of magnitude larger than the 
probability that it will nucleate in a region of uniform composition, B. 

When a §-S nucleus forms in a B- element it will immediately grow 
to the boundaries of the element. Then if the dimensions of B- element 


\ 


\ 








Fig. 17. Distribution of embryos, depleted in a minor solution 
constituent, with respect to size 


equal or exceed ¢* and r* the nucleus will continue to grow into the B 
region providing the interfacial energy between B and B- is negligible 
when the structure is the same on either side of the boundary. There- 
fore, in an alloy of the type we are considering it is possible that / is 
nucleated entirely by B- elements formed by fluctuations. 

At relatively low temperatures the composition fluctuations may be 
frozen in and the number and size of B- regions will be the equilibrium 
number characteristic of a high temperature. We shall suppose that the 
B- elements are entirely transformed to £. A schematic representation 
of the number of these B- regions as a function of « (proportional to 
volume) is given in Fig. 17. At any temperature there is a critical 
volume, corresponding to i*, of B- that will transform the surrounding 
« of B composition to 8. When a sample is quenched to any temperature 
T the number of these supercritical regions, hence / nuclei, is given by 
(99). A number, »,*, will grow immediately and the transformation 
will stop. When the temperature is further lowered i* is decreased and 
more § nuclei are formed. Since these nuclei are brought into being not 
by thermal composition fluctuations at temperature but by having the 
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potential barrier to nucleation lowered they are called athermal nuclei. 
The total number that have formed upon quenching to temperature 7’, 
is proportional to the area aci,*, in Fig. 17. Upon quenching to 7, 
an additional number per untransformed volume, proportional to the 
area ai,*i,*b in Fig. 17, form. 

When AQ@, is sufficiently large so that the equilibrium concentration 
of § nuclei in B regions is appreciable nucleation takes place isother- 
mally and at a steady rate. For alloys the possibility exists that £ is 
nucleated by B- regions at temperatures too high for the formation of 3 
nuclei in B regions. Nuclei formed in this way come into being only 
when the temperature is lowered and the martensite formed therefrom 


is athermal. 

By considering the heterogeneous distribution of free energy expected 
in a real crystal a qualitative theory for athermal nucleation can be 
derived without invoking composition fluctuations. Suppose that (see 
equation (96)) the volume of the preferred nucleation sites is of the same 


order as v*, the volume that the # nuclei must exceed in order to grow 
into the « matrix of mean free energy. Then at a preferred site having a 
volume, v, 8 may nucleate, grow to the site boundary, and be unable to 
grow further until the temperature is lowered sufficiently to make 
v* <v. There will be some history-sensitive distribution of preferred 
sites that will be described by a function n, = f(v, AG,), where n, is the 
number of sites characterized by volume v and free energy AG,. The 
number of athermal nuclei, n, formed at a given temperature can be 
calculated from n, = f(v, AG,). n will be the number of high energy 
regions having a volume exceeding 7r**t*/2, where r* and ¢* are given 
by equations (94) and (95). 

CoHEN, MacHLIN and co-workers”: * have proposed an alternative 
theory for the nucleation of martensite that ignores any contribution of 
interfacial energy to the free energy of nucleus formation. Let the angle 
through which « must be sheared to form f be 6,_,. There will exist in 
the « many small embryos that are only partially sheared to 8—that is, 
6 < 6,_,. These 8 embryos must be characterized by their shear angle 
6 and the number of atoms i that they contain. 

Now according to the CoHEN-—MACHLIN theory, for a given value of i, 
the free energy of the 6 embryo goes through a maximum at @ = 6,, as 
6 is increased. We may imagine a three dimensional surface AG 

= f(i, 6) for constant temperature. The problem is to find the saddle 
point in this surface corresponding to the minimum free energy that the 
embryo must have before it can grow to a large size with the free energy 
continually decreasing.t 

+ It will be noted that the Comen—Macu.r theory of nucleation with i and 6 the 


variables is the structure-strain analogue of the Boretrius theory of nucleation of 
precipitate with + and Az the variables. 
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In the AG = f(i, 6) surface there is a single ridge (shear ridge), the 
maximum height of which is given by the relation AG,, = F(i), where 
AG = AG,, at 6 = 6,,. The value of i at the saddle point i = 7* should 
be found by solving the equation d(AG,,)/di = 0. The major difficulty 
of the CoHEN-—MACcHLIN theory is that the functions AG = f(i, 6) and 
AG,, = F(i) are only qualitatively defined. We expect that the in- 
crease in free energy of the embryo due to shearing through a given 
angle, e.g. 6 = 6,,, should increase continuously with i and that the 
critical fluctuation, « — #, will, therefore, cross the shear ridge at a 
value i + 0. Therefore, the ConEN—MACHLIN theory seems to require, 
as does the Bore.ivs theory, an additional relation that will permit 
the calculation of the size of the region that undergoes the critical 
fluctuation. 

CoHEN, MACHLIN and co-workers cite the often-observed heterogene- 
ous nature of the martensite transformation as strong supporting evi- 
dence for their theory. However, as we have seen, the surface barrier 
theory also predicts a heterogeneous martensite reaction if AG varies 
from one element of « volume to another due to non-equilibration. 


Comparison of Theory and Experience 


Non-martensite Transformations—Although there have been extensive 
measurements of the isothermal kinetics of various solid-solid non- 
martensitic N-type transformations there is little quantitative know- 
ledge of the nucleation frequency for these transformations. It is known 
that the supercooling required to initiate N-type transformations in 
pure metals is usually very small; for example, of the order of 5-10° in 
the y — « iron transition. Orientation relations for these transforma- 
tions are usually consistent with the description that the low index 
crystallographic planes and directions in the two structures having the 
most similar atomic packing and registry are parallel. Nuclei usually 
form preferentially in the vicinity of grain boundaries and free surfaces. 
More quantitative, carefully controlled measurements of nucleation 
frequency in \-type transformations are necessary in order to critically 
test the predictions of the nucleation theory. 


Martensite Transformations—The data on nucleation frequency for 
S-type transformations are no more quantitative than for N-type but 
more information of value for comparison with nucleation theories can 
be inferred from the overall S-type transformation characteristics. 

For many years it was believed that S transformations were insup- 
pressible by quenching and almost entirely athermal in nature. Recently, 
however, KUuRDJUMOV and Maksrmova” showed that the S transforma- 
tion in certain iron alloys could be suppressed entirely by rapid quench- 
ing to liquid nitrogen temperature and upon subsequent isothermal 
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holding at this temperature and higher an S-type transformation 
occurred that was entirely isothermal in character. As much as 50 per 
cent of the y-iron phase could transform to « by the S-mechanism 
isothermally. The essential requirement for the isothermal mode of 
transformation seemed to be that the alloy be characterized by an M, 
temperature of the order of 300°K or less. 

About the same time AVERBACH and CoHEN”: * showed that there 
was a small isothermal component of the S transformation in iron— 
| per cent carbon alloys. However, the amount of alloy transformed 
isothermally was small relative to that transformed athermally. More 
recently Das Gupta and Lement’” found that after some athermal S 
transformation in a high chromium steel a substantial amount of trans- 
formation took place isothermally. They found that considerable 
athermal transformation had occurred before the isothermal trans- 
formation began. 

Kuutry and CoHnEen® were unable to confirm KurDJuMov and 
Maksrmova’s result that athermal martensite formation in a 6 per cent 
manganese ()-6 per cent carbon steel could be completely suppressed by 
rapid quenching. However, it is significant that Kurpscmov and 
Maxsmmova (K-M) report an M, for this steel about 100° lower than 
that found by Kunry and ConEen® and other investigators; it seems 
likely, therefore, that K-M made an error in reporting the composition 
of the steel. 

Recent results of Cecn and HoLttomon* on the isothermal S trans- 
formation of a 23 per cent Ni, 4 per cent Mn, 0-05 per cent carbon steel 
have completely confirmed K-M’s main conclusions. The transforma- 
tion was entirely suppressed, within the experimental uncertainty, 
upon rapid quenching to — 196°. Upon holding at various temperatures 
between — 196° and — 80° transformation occurred isothermally and as 
much as 50 per cent of the specimen transformed by the isothermal 
mode. The results were satisfactorily described by a conventional time- 
temperature transformation (T.T.T.) diagram (Fig. 18) with a nose 
temperature at — 126°C. Particular isotherms were described fairly 
well by straight line relations between the fraction transformed and the 
logarithm of the time. Metallographic examination disclosed that the 
rate determining step in the transformation is nucleation and not 
growth. K-M and Das Gupta and LEMENT” also found that nucleation 
is the rate-determining step in the isothermal transformation. 

FIsHER™ established: that Cec and HoLLomon’s (C—H) isotherms can 
be described on the assumption of a constant nucleation frequency per 
untransformed volume. The rapid decrease in transformation rate with 
time is due to a partitioning effect whereby martensite plates already 
formed cut down the volume that can be transformed by nuclei formed 
at later stages of the transformation. FisHEeR** also demonstrated that 
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the C-H T.T.T. curves are described remarkably well by the FisHer- 
HoOLLOMON-—TURNBULL (F-H-T) theory of thermal martensite formation 
(equations (88) to (95)). Since the rate determining step is nucleation 
the temperature 7’,,,, corresponding to the nose of the T.T.T. curve can 
be calculated by substitution of equation (93) into (28). Upon per- 
forming this operation using JONES and PumpHREyY’s® free energy data 
and assuming AG, = 0, FisHErR calculates 7',, = 147°K, in excellent 
agreement with the measured 7',, = 147°K. Further, with a single 
disposable parameter (c*c*) the entire T.T.T. curve was described from 


-aol 


DEGREES C 








ISOTHERMAL HOLDING TIME (t+!) MIN 


Fig. 18. T.T.T. curves for formation of martensite in Fe-Ni-Mn-C alloy. 
After CecH and HoLLomon® 


equations (24) and (93) within experimental error. K,, (see equation 
(24)) was estimated from the order of magnitude of J to be 10°* sec cm-*, 
which agrees very well with the value 10°’ sec? cm~-* estimated from 
equation (24). This agreement between theory and experience is 
certainly not trivial; it constitutes powerful support for the F—~H-T 
theory of nucleation without composition fluctuations in S trans- 
formations. 

F-H-T have favoured the view that athermal martensite can be 
accounted for only by frozen-in composition fluctuations. For example, 
in iron base alloys the elements most effective in retarding the martensite 
transformation are those that decrease the free energy driving trans- 
formation. When the composition permits, fluctuations occur that lead 
to the formation of an appreciable concentration of regions free of 
alloying element and nucleation of « in these regions is much more 
probable than in regions of uniform composition. When an alloy is 
quenched to a particular temperature the total number, NV, of martensite 
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plates that have formed per volume and when the fraction transformed 
N lt +n,* _ . (100) 


J is the nucleation frequency per unit volume for regions of 
iniform composition {/ A. exp AG* kT), where AG* is given by 


93)} and n,* given by equation (99) is the number of plates 


where 


equation 
formed from nuclei produced by frozen-in composition fluctuations. 
The first term of equation (100) is the thermal component and the 
ond term the athermal componen he nucleation. When the 
ving content is ' 30 atom per cent) the probability 


large region of pure parent metal is small, so that n,* is 


und the thermal component, /f, may predominate. However, if 


illoving content is quite small n.* may 


t and most of the transformation will take place by the 


become large before /t 


npecomes importan 


uthermal mode. A more quantitative theory of the relative magnitude 


yf the thermal and athermal components is desirable 
F-H-T have demonstrated that the variation in ./, temperature with 
composition for the tormation of athermal martensite in iron alloys 
ontaining carbon and chromium™: © is described by the composition 
ution theory with the use of only one disposable parameter which 
interfacial energy per area. Also the data of Harris and 
of athermal martensite formed as a function of 
irbon-chromium steel were described within experi 
see equation (93)) the only disposable parameter. 
[It was shown that the values of o (24 ergs per cm*) and c (0-46 x 10 
dynes per cm*) chosen to give agreement between theory and experiment 

are plausible 

[t has not vet been established whether or not composition fluctua- 
4 necessary condition for the athermal mode of martensite 
mn Athermal martensite is known to form in the low tem pera- 
ture transition face-centred cubic — body-centred cubic in lithium 
cobalt®’ (face-centred cubic — hexagonal close packed) 
However, these substances were not of highest purity 
study on the effect of alloying elements on the 


ite transformation in these metals seems to have 


nade In order to find whether or not com position fluctuations 


concomitant of athermal martensite in these metals it 


necessary 
> necessary to make such a study on the effect of alloving elements 
irtensite that forms, either thermally or athermally, 

y be materi ily decreased by decreasing 

he specimen is initially cooled to 7,. This pheno 
menon is tabilization. For example, Cecn and Hotitomon™ 


found that the rate of isothermal! transformation in the Fe-Ni-Mn alloy 


is substantially reduced by long holding times at room temperature 
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before quenching to the transformation temperature. Further if after 
the isothermal transformation is underway at a temperature 7’, the 
sample is upquenched to the ambient temperature and then quenched 
to 7, the transformation rate at 7’, is very much less (by an order of 
magnitude) than before the upquenching. 

CoHEN, MACHLIN and Paranypre”™ have suggested that stabilization 
results from the lowering of the stress levels at preferential nucleation 
sites in y-iron by a recovery process. That is, stresses are developed in 
certain volume elements of the y due to quenching, for example. « 
nuclei probably prefer to form at these stressed sites. These stresses 
may be partially relieved by recovery processes so that nucleation of « 
is less probable—hence, stabilization occurs. 


STRUCTURE STRAIN AND COMPOSITION 

Theory 

An embryo for precipitation is described by its structure, composition, 
size, shape and state of strain. The problem of finding the most probable 
values of these variables for the critical cluster or nucleus is formidably 
complex. The free energy of formation of such a cluster, that is 
incoherent and has the composition of the equilibrium precipitate, is 
given by equation (65), except that now AG will be a function of z. 
For an oblate spheroidal form we have: 


Y 


, (AG, a . 
AG, = 4na*(1 — e = + Ké*f[(1 — e?)'/] 
t 


(101) 


The free energy of the critical cluster AG,* = f(e*, a*) is found in the 
same way as before (see equations (67) and (68)). 

For reasons similar to those already outlined in the section on struc- 
ture and strain, precipitation nuclei may prefer to form coherently or 
semi-coherently in order to minimize their surface energy. The expres- 
sion for the free energy of formation of semi-coherent clusters is 
analogous to equation (70), if the precipitate is softer elastically than 
the matrix, 


AG, = 4ra3(] = e*)'/*! AG, V K'e*f{ (1 ~_ e7)'/*)} 


(102) 


where K’ is the elastic coefficient appropriate to the coherent straining 
of the cluster. The free energy of the critical cluster AG,* = f(a*, e*, e* 
is found by the procedure indicated in the earlier equations, (72) 
through (74). 
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[f dis relatively small and y is relatively large, clusters having a nearly 
spherical shape will form. Following Fisner, HOLLOMON and LescHen™ 
the free energy of forming a partly coherent spherical cluster is written 
NAG , n(O ivr 4/3)rr4( AG, + ce* L103) 
where AG AG. V and c is the ippropriate elastic coefficient It 


follows that the free energy of formation ol the critical cluster is 


i 
\G.* l6nr 3 n(0 *))* (AG. + ce**)* 104) 


where ° ) ; 3 7 al 105) 


und the condition tor coheren 


AG =~ : a+ | Bony Zz . (106) 


When the supersaturation is very large AG - Z and the nucleus 
forms coherently. When Z \G,|, 6 

Une important consequence ol the 
\2 theory of precipitation is that the 
coherent precipitate can form and 
grow only at temperatures well below 
the temperature at which the com- 
pletely non-coherent precipitate is in 
equilibrium with the solid solution. 
This consequence was recognized by 
NaBARRO” and BarRrRetTr® and 
, has been treated in more detail by 
ss Lescuen and FisHer.” Following 


Pig. 19. Comparison of free energy- Ficuer HoLLomMoN and LescHen® 
composition dependence of coherent 





the free energy relations between solid 
Ana non coherent phases After - 


Fisuex. HoLLomon and Lescuen* S0lution, «, and coherent, #, and non- 


coherent, £., phases are shown sche- 
matically in Fig. 19. The free energy of 2, at a given x exceeds that of f 
by an amount cé*. The temperature at which @ is in equilibrium with « 
is AJ lower than the temperature at which « and # are in equilibrium. 


As pointed out by NABARRO, where 6 is relatively large, e.g >= 0-10, 


it is sometimes not thermodynamically possible to form a completely 
coherent precipitate 

The problem becomes additionally complex when the cases of rigid 
and semi-rigid precipitates are taken into account. Then the shape and 
energy of the particle will depend on the relative elastic moduli and the 


snisotropy of the surface energy 


f om parison of Expe rience and T he ory 
To the best of the authors’ knowledge there have been few quantitative 


measurements of the nucleation frequency of precipitate from solid 
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solutions. However, some generalizations can be made from qualitative 
observations. GEISLER”: ** has emphasized that the vicinity of grain 
boundaries and slip bands are preferred sites for the nucleation of 
precipitate when the supersaturation is not large. Thus non-equilibrated 
structural defects, such as grain boundaries and possibly dislocations, 
apparently catalyze nucleation of precipitate. The preference of nuclei 
for grain boundary sites is understood, since grain boundary energy is 
released (see equation (61)) by the formation of a cluster thereon. It is 
not known to what extent nucleation of precipitate is catalyzed by 
accidental inclusions.** 

A second important generalization is that there exists always a 
definite crystallographic orientation relation between the precipitate 
particles and the matrix. In general, this relation is that the planes and 
directions in which the atomic packing is most similar are parallel in the 
two lattices. For example, in the precipitation of a face-centred cubic 
precipitate from a face-centred cubic matrix the cube edges and cube 
planes in the two lattices are usually parallel. This result might be 
explained either on the basis that only the particles with the preferred 
orientations nucleate or grow rapidly. However, on the basis of the 
theory that has been developed there is good reason for expecting the 
nucleation of preferred orientations.* 

From the dislocation theory of interphase boundaries it is expected 
that their interfacial energy will be a minimum when 6 is a minimum. 
Therefore, on the basis of surface energy considerations only, clusters 
having an orientation relation with the matrix that minimizes 6 will 
form preferentially. The observed orientation relationships for precipi- 
tation are in general consistent with the principle that 6 be a minimum 
for low index planes. 

It seems probable that the dominant factor controlling the orientation 
relation of an incoherent nucleus with the matrix is surface energy and 
that strain energy will have little effect. This conclusion follows from 
the NaBaRRO theory that the major part of the strain is generally taken 
up by the matrix when incoherent clusters are formed. However, when 
the nucleus is partly coherent with the matrix it appears that the 
orientation relation will be controlled by the relation of the elastic 
coefficients and interfacial energy to crystal direction. For if the 
coherency strain is taken up by the cluster it is evident that the 


coherency plane relation corresponding to the minimum c will minimize 
the strain energy. The orientation relation that minimizes the strain 
energy of the critical cluster does not necessarily minimize the interfacial 


energy. 
In summary the problem of nucleation of precipitates from solid 
* It is certain that an “‘oriented growth”’ theory cannot account for preferred orienta- 


tions of precipitates in a cubic matrix when the growth is diffusion controlled. 
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solution is very complex and the most useful approximations have not 
yet been established. Even so, the general nature of such precipitation 
can be described. The precipitation process that will be observed at any 
temperature below the solvus will be the one that occurs at the maxi- 
mum rate at that temperature. At temperatures just below the solvus 
the precipitation will occur heterogeneously and incoherently. If the 
temperature is further lowered, the rate of overall precipitation can be 
increased either by an increase in the rate of nucleation or by an increase 
in the rate of growth. Such an increase can be accomplished by adjust- 
ing the composition or structure to minimize the surface energy between 
precipitate and matrix or by adjusting the composition and structure 
to allow a more rapid rate of growth of the precipitate. One way in 
which the surface energy is reduced is by the formation of coherent 


precipitates which can occur only at temperatures sufficiently below the 


solvus to make the free energy available for straining the precipitate. 
Thus at temperatures below that at which incoherent precipitation 
occurs coherent homogeneous nucleation will occur at a more rapid rate. 
At still lower temperatures, the rate of overall transformation can be 
further increased by allowing the precipitate to form with the same 
composition as the matrix thus eliminating the necessity for long-range 
diffusion and forming a coherent particle in such a way that transport 
of atoms across the interface can occur rapidly. This latter type of 
transformation has been given the name martensite. Thus three dis- 
tinct types of precipitation are possible: first, coherently nucleated, 
second, incoherently nucleated, and martensitic. There are obviously 
intermediate types, for example, when the precipitation occurs partially 
coherently. 

There is much valuable data on orientation relations in precipitation 
that can be qualitatively explained. Also there is reliable information on 
the coherency relations in many systems. Unfortunately, there are few 
data on the isothermal course of precipitation as a function of time and 
even less quantitative data on the nucleation frequency in precipitation 
from solid solution. There is a great need for such data to test the 
prediction of current theories of nucleation of precipitate and to aid 
their further development. 
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